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Preface 


The increasingly widespread interest in, and application of, 
factor analysis in recent years has made it desirable that the 
method be introduced earlier in the student’s statistical training 
than has usually been the case. There are a number of excellent 
advanced treatments of the subject such as Thurstone (1947c) ,* 
Holzinger and Harmon (1941), Thomson (1951), and ‘Cattell 
(1952a). These treatments have been found difficult by many 
otherwise competent students because of the mathematics and 
notation involved. It is hoped that this book will serve as an 
introduction to the subject and as a steppingstone to these more 
advanced texts. An attempt has been made to introduce and 
develop the concepts of factor analysis in logical order. 

In general, a background in high school mathematics only 
is called for, although a course in analytical geometry is helpful. 
The theory of matrices, the mathematical basis for factor analysis, 
is introduced in Chapter 3. The student should also have a back- 
ground in statistics, through multiple correlation, and sufficient 
background in some content area to appreciate the need for the 
application of the factorial methods to that area. 

A wide variety of methods and procedures is available in 
factor analysis. Only those methods which, in the author’s ex- 
perience, have been found to be the most generally useful and 
practical have been included in this introductory book. Possible 
exceptions are some of the methods in Chapter 8 (e.g., direct 
rotation) which are presented as much for their theoretical in- 
terest as for their practical value. Exercises and answers have 


1See Bibliography at end of book for complete reference. 
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been provided so that the book can be used for self-instruction 
by those persons who do not have access to an organized class. 

The writer has found it advisable to have his students obtain 
a broad background in the applications and methods pertinent 
to their fields of interest by having them read widely in the liter- 
ature, and an extensive bibliography has been included to serve 
as a guide to recent contributions. 

Acknowledgments to various authors and publishers have been 
made through the book. Special acknowledgments, however, are 
made to the University of Chicago Press, the American Psycho- 
logical Association, and publishers of the following journals for 
permission to quote from their publications: Journal of General 
Psychology, Journal of Genetic Psychology, Journal of Experi- 
mental Education, Journal of Personality, Psychometrika, Journal 
of Comparative Psychology, American Journal of Psychology. 

Many people have contributed to the development of this 
book. The writer is especially grateful to Professor J. P. Guilford 
for instruction, encouragement, and editorial guidance; to Pro- 
fessor L. G. Humphreys for first interesting him in factor analysis 
and for reading portions of the manuscript; to Professor L. L. 
Thurstone for his many original contributions to factorial 
methods; and to his students for constant stimulation. He owes 
a special debt of gratitude to his wife, Dorothy A. Fruchter, for 
continued interest and assistance in the preparation of the manu- 


script and drawing of the figures and to Annette Stewart for typing 
assistance. 


Foreword 


Since the time when that great innovator, Charles Spearman, 
presented to the world his theory and method of factor analysis, 
the use of factor-analytic procedures has increased at a rapid rate. 
This trend was given a sharp acceleration in the United States 
by Thurstone’s generalization and extension of factor theory and 
methods with the publication of his important contribution, 
Vectors of Mind, in 1935. 

More and more it is being realized that the end product of 
science is in the form of statements concerning the interrelation- 
ships between things. It is also realized that the most fruitful 
type of concept of the things related is in the form of variables 
or dimensions. The major problems are therefore correlation 
problems. Where the appropriate variables are readily observ- 
able or easily inferred from objective data, one may proceed to 
the discovery of the interrelationships. In psychology and the 
social sciences, where the number of potential variables is very 
large and the useful variables that we need for economical and 
dependable descriptive purposes are overlaid with multiplex 
manifestations, the demand for some method that will facilitate 
the discovery of those underlying variables is very great. It is 
in the fulfillment of this objective that we find factor-analysis 
methods to be of greatest value. 

The factorial methods were instituted originally for the study 
of the basic or underlying variables needed to account for in- 
dividual differences in measurements of abilities or aptitudes in 
terms of test scores. It was eventually realized that the methods 
could also be fruitfully used to discover underlying dimensions 


vu 


Vili Foreword 


of temperament, interests, and attitudes—in fact, dimensions un- 
derlying any aspect of personality, including morphological and 
physiological traits. The extension of factorial methods to vari- 
ations known as the Q-technique and the P-technique has offered 
promising tools for the understanding of personality structure 
in individuals. This has appealed to some clinical psychologists 
who seek a more rigorous way of approaching general principles 
of personality which keep close to the study of individuals. 
Factor theory is as yet the most promising source of a mathemat- 
ical model for a comprehensive, rigorous theory of personality 
structure. : 

Problems in experimental psychology have also been attacked 
from the factor-analysis approach, including the problem of sen- 
sory attributes and problems of the variables of emotion, humor, 
and esthetic reactions. Problems of social behavior and of in- 
dustrial morale have been approached through factorial methods. 
Outside the field of psychology there have been many applica- 
tions; to problems of biological growth, to voting behavior of 
legislators and judges, to educational achievement, to criteria of 
job, performance, and to allergy reactions. 

As courses for the instruction of students in the ideas and 
ways of factor analysis have been introduced one by one, the need 
for a simple introduction has increased. In writing this volume 
Professor Fruchter has kept in mind the beginner in the study of 
factor analysis. Fruchter's extensive experiences in the methods 
have fitted him well for this assignment, The clear and succinct 
exposition of theory and methods will undoubtedly be welcomed 
especially by those who have not been favored by the opportunity 
to acquire the mathematical background necessary for the expert’s 
grasp of the subject. The selection of techniques to be presented 
seems to include those that are most likely to be needed by the 
average investigator who utilizes factor analysis in his research. 
The step-by-step demonstrations should be relatively easy to fol- 
low. It is anticipated that this volume will be useful as a ready 
handbook for the investigator as well as a textbook for the be- 
ginner in factor analysis. 


J. P. GUILFORD 
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Introduction 


Tue Nature or Factor ANALYSIS 


|. THE continuing effort of science to establish 

theoretical systems which will account for observed phenomena, 
identification and measurement of basic variables are of para- 
mount importance. Almost every field of science has set up a 
number of classificatory categories which can be described and =) 
measured. Factor analysis starts with a set of observations ob- 
tained from a given sample by means of such a priori measures. , 
It is a method of analyzing this set of observations from their inter- | 
correlations to determine whether the variations represented can `\ 
be accounted for adequately by a number of basic categories + 
smaller than that with which the investigation was started. Thus 
data obtained with a large number of a priori measures may be 
explained in terms of a smaller number of reference variables. ` 

Psychologists and educators, for example, have tried to gen- 
eralize concerning the supposed basic dimensions of human be- 
havior and have adopted such well-known trait-concepts as mem- 
ory, imagination, judgment, etc. In the field of mental testing 
there has been an outpouring of tests of general intelligence and 
of specialized abilities such as clerical, mechanical, or medical 
aptitude. With regard to intelligence, factor analysts might be 
interested in determining whether the individual differences repre- 
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sented by intelligence-test scores are attributable to a single source 
of variation or whether they represent the operation of a com- 
bination of several mental traits, such as reasoning, verbal, and 
numerical abilities which individuals may possess in varying com- 
binations. They might also inquire whether the various types 
of specialized ability that have been distinguished are really dif- 
ferent or whether they represent conglomerations of a set of basic 
factors which run through them, in terms of which they can all 
be described, and which would make their similarities and differ- 
ences more apparent. Similarly, in the area of clinical psychology 
a large number of diagnostic categories have been adopted by psy- 
chiatrists, clinical psychologists, and others to classify neurotic and 
psychotic behavior disorders. The role of factor analysis here 
might be to determine whether these classification systems can be 
verified with empirical data for specific populations, or whether 
they overlap to such an extent as to call for a reordering and 
redefinition of the sets of diagnostic categories. 

Factor analysis provides a mathematical model which can be 
used to describe certain areas of nature. A series of test scores OY 
other measures are intercorrelated to determine the number of 
dimensions the test space occupies, and to identify these dimen- 
sions in terms of traits or other general concepts. The interpreta- 
tions are done by observing which tests fall on a given dimension 
and inferring what these tests have in common that is absent from 
tests not falling on the dimension. Tests correlate to the extent 
that they measure common traits. By observing and analyzing 
the pattern of intercorrelations, the Operation of one or more 
underlying traits or other sources of common variance is inferred. 

It is thus by establishing the basic sources of variance in a 
field of investigation and determining the nature of each measure 
in terms of basic categories that we can know what types of varia- 
tions the tests (or other variables) are measuring, the interrela- 


tionships of these measures, and what needs to be 
or improve them w 


purposes. 

It has been pointed out by Royce ( 
for research programs might be, first, t 
ures in a field of investigation 


done to modify 
hen they are not entirely satisfactory for our 


1950c) that a proper order 
o use a set of a priori meas- 
and factor analyze them to deter- 
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mine the basic traits or other sources of variance operating; sec- 
ond, to study these factors, one at a time, by the techniques of 
analysis of variance to determine how they are affected by differ- 
ent experimental conditions or how they vary among groups that 
differ with respect to age, sex, education, or other pertinent back- 
ground variables; and lastly, to study them experimentally in the 
laboratory for specific groups under carefully controlled condi- 
tions. While this order need not be invariable it does help to 
point up the proper function of each type of approach. In factor 
analysis the “individual differences” represented by a large num- 
ber of measures, that are given to a single population, usually at 
one time under a standard set of conditions, are studied to detect 
possible common sources of variation or variance. In analysis of 
variance a single measure is administered over a series of occasions 
and conditions to determine the significance of group differences. 
The individual differences by this latter approach are often used 
as an estimate of “error variance” against which to evaluate the 
group differences. 

It is quite understandable that some psychologists have used 
the results of factor analysis in the area of cognitive tests as support 
or disproof of theories of intelligence. The results of factor anal- 
ysis can serve only as indirect evidence for this purpose, however, 
since factor analysis attempts to account statistically for differences 
in traits among individuals rather than for the mental organiza- 
tion within any one individual.’ The fact that the observed 
differences can be accounted for by weighted sums of measures 
of the reference variables obtained from a factor analysis should 
not be construed to mean that any one individual’s behavior is 
a resultant of the additive combination of these hypothetical traits. 
They merely serve to account mathematically for observed or pre- 
dicted individual differences. Moreover, as methods of measure- 
ment and prediction become more refined and exact, factorial 
methods based on multiplicative or more complicated relation- 
ships for combining variances undoubtedly will be developed. 
While nonlinear relationships may be more appealing logically, 


1 For a possible exception to this see the descriptions of O- and P-techniques 
in Chapters 10 and 11. 
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nonoverlapping traits (Figure 1.1d); If there were a significant 
correlation between the reading-comprehension and addition 
tests, there would be two alternative possibilities, and we would 
need the correlations with still another test to decide which is 
the more likely. The overlap could be either largely on the same 
trait as that between the arithmetic-reasoning and reading-com- 
prehension tests (Figure 1.1e), or it could be wholly or in part on 
other factors (Figure 1.1f). In the former case, a single factor 
would account for most of the observed intercorrelations, whereas, 
in the latter instance, it would be necessary to assume more than 
one common factor. 

In this way, if tests were added one at a time and the degree 
of their correlation with each previous test observed, the number 
of relationships to be held in mind with each added variable 
would increase rapidly, and more systematic ways of summarizing 
these relationships would be needed. The various methods of 
factor analysis have been devised for that purpose. 


SPEARMAN’sS TWwo-FACTOR THEORY 


Spearman was one of the first to attack the factor 
problem. At first he worked with groups of four tests. Observing 
that tests of abilities tend to have positive intercorrelations, he 
was able to show mathematically that the intercorrelations as rep- 
resented in Table 1.1 could be, but were not necessarily, accounted 


TABLE 1.1. INTERCORRELATIONS OF FOUR TESTS 


for by a single source of variation (or factor) if the coefficients 17 
any combination of two columns (avoiding the cells in the pri” 
cipal diagonal, which are vacant) are proportional. Thus, for 


columns 1 and 2, to satisfy this “criterion of proportionality,” We 
have the equation 
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731 Tai 
— = Gly a = Tastes (1.1) 
732 T42 
For columns 1 and 3 
721 Yar 
=- OF foia = Paster (1.2) 
r23 743 
For columns 1 and 4 
721 731 
—=— OF Pairs, = oga (1.3) 


r24 134 


Similar equations can be written for columns 2 and 3, 2 and 4, 
and 3 and 4, These would, however, merely duplicate the equa- 
tions already written. When the equalities of equations 1, 2, and 
3 exist, the criterion of proportionality is said to be satisfied, and 
the four tests are assumed to have one factor in common. An- 
other way of expressing equations 1.1, 1.2, and 1.3 would be: 


rairaa — 130741 = 0 (1.12) 
T2143 — Teats1 = O (1.22) 
21734 — Tears = O (1.32) 


These are Spearman's famous tetrad differences. Divergence of 
these differences from zero, within the limits of sampling error, are 
tolerated. If more than four tests are being investigated for this 
criterion, all combinations of four tests are examined, one at a 
time. This becomes quite laborious, of course, if there are many 
tests. Ten tests would yield 252 different tetrads, 20 tests would 
yield 2,907. Any one of n tests is involved in y(n — 1)(n — 2) 
(n — 8) different tetrad differences and any one correlation co- 
efficient in (n — 2)(n — 3) different tetrad differences. 

Spearman and others found a considerable number of ability 
tests whose intercorrelations satisfied the proportionality criterion 
with sufficient closeness that their intercorrelations could be ac- 
counted for by a single factor. ‘This factor was called g, the gen- 
eral-intellectual factor. Spearman’s two-factor theory postulated 
that every test which satisfies the criterion of proportionality con- 
tains two factors, g and s. His g-factor is a general factor common 
to all intellectual tests. The s-factor is specific to each test and 
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represents that portion of the reliable variance of a test that does 
not correlate with other tests. Spearman developed the hypothesis 
that g is a function of heredity, whereas s-factors represent the 
acquisition of specific learnings and experiences. Those tests that 
did not satisfy the proportionality criterion were discarded from 
the battery of tests of g or general 
intelligence. 

More efficient methods of ex- 
amining tables of intercorrela- 
tions for proportionality or “hier- 
archy,” as it is sometimes re- 
ferred to, have been developed. 
However, it is now generally 
realized that examining a table 
of correlations to determine 
whether it can be accounted for 
Ficure 1.2 Schematic representation by a single factor is a special case 
of Spearman's Peoi theory for of the more general problem of 

determining the minimum num- 
ber of common factors that are needed to account for all of the 
relationships. Spearman’s technique is now largely of historical 
interest, but it should be added that Spearman has had a strong 
influence on British factor analysts, and they and their followers 


TABLE 1.2. TABULAR REPRESENTATION OF SPEARMAN’S 
TWO-FACTOR THEORY 


Factor 


usually look first for a general factor in any table of intercorrela- 
tions. 

A schematic representation of Spearman’s two-factor theory 
for six tests is shown in Figure 1.2. Table 1.2 presents the same 
information in tabular form. The numbers inside both the circle 
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and ovals are the correlations (or loadings) of each test with g. 
The numbers in the portion of the ovals not included in the circle 
are the loadings of the tests on their specific factors, assuming 
perfectly reliable tests. Each of these loadings must be squared 
to determine the proportion of the total variance of the test rep- 
resented on the general factor.* For a perfectly reliable test f, 
the total variance in standard scores can be represented by equa- 
tion 4, where g stands for the loading on the general factor and s 


g +s; = 1.00 (1.4) 


stands for the loading on the specific factor.* Thus, for test 4 in 
Table 1.2, .82 + .6? = 1.00. 

Seven factors, one general and six specific, are represented in 
Table 1.2, but only one of them, g, is required to account for the 
correlations. The correlation between any two tests is dependent 
on the amount of g-factor they have in common and is obtained 
from the product of the two loadings. The correlation of tests 1 
and 2 (in Table 1.2) is .7 X -5 or .35, that between tests 1 and 3 
is .7 x .3 or 21, etc. In this way the entire table of correlations 
can be obtained and is shown in Table 1.3. The numbers in 
parentheses (in the diagonal cells) are obtained by multiplying 
the loading of each test on g by itself. This value is known as 
the communality of the test, represented by the symbol h?, and 
indicates the proportion of the total variance of the test which 
is held in common with the general factor. 

Bare aca : 
2 As in all test theory, variance is equa 


Factor loadings represent correlation an 


variance, z P : 
3 The g-loading of test j can be calculated from its correlations with 2 other 


tests k and Z by the formula 
2a aie 
si = Tk 


is satisfied for these tests. In a matrix of correla- 
j can be estimated by the formula 


Ja = Af 
£ NT 24; 


elations (where each occurs twice and the diagonal 
lations in row j; A’; = sum of the squared 


Cf. Spearman (1927), appendix. 


1 to the square of the standard deviation. 
d must be squared to be converted to 


if the proportionality criterion 
tions, the g-loading of variable 


where T = sum of all the corr 
cells are empty); 4, = sum of all the corr 
Correlations in row j; 4;2 = the square of A;. 
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TABLE 1.3. 


INTERCORRELATIONS OF SIX TESTS REPRE- 


SENTED IN FIGURE 1.2 AND TABLE 1.2 


HOLZINGER’s Bi-FACTOR THEORY 


More recently Spearman and his adherents have 
realized that those tests which do not satisfy the criterion of pro- 
portionality and which Spearman has termed “‘disturbers” may be 


Figure 1.3 Schematic representation of 
Holzinger’s bi-factor theory. 


retained in the correlation ma- 
trix if it is recognized that 
some of the tests may have a 
factor in common, in addition 
to the general factor that is not 
common to all of the tests. 
These factors, common to 
groups of tests, are termed 
group factors. Holzinger’s bi- 
factor method, which is a vari- 
ation of Spearman’s two-factor 
method, obtains a general and 
one or more group factors. 


The bi-factor theory may be represented schematically, as in 


Figure 1.3. 


Table 1.4 gives the same information in tabular 


form. The variance for test j in standard scores, if it is assumed 


TABLE 1.4. TABULAR REPRESENTATION OF HOLZINGER’S 
BI-FACTOR THEORY 


Factor 
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to be perfectly reliable, is given by equation 5, where c is its load- 
ing with a group factor. 

SF + G? + 5? = 1.00 (1.5) 

The correlation between two tests is determined by the extent 


to which they are loaded on the same factors and can be calculated 
from the cross-products of their loadings according to equation 6. 
Tye = Bin F jiki F Cjoteg bes CjrCkr (1.6) 
The correlation between tests 1 and 2 (in Table 1.4) is equal 
to (.6 X .4) + (.3 X .4) or .36, and the correlation between tests 
l and 5 is equal to (.6 x .6) + (.3 X.0) + (.0 X .7) or .36. The 
correlations among all the other tests may be similarly computed. 
In this example nine factors are used, but only three of them, 
the general and two group factors, enter into the correlations 
among the six tests. An advantage of the bi-factor method over 
the two-factor approach is that it does not insist on a single com- 
mon factor but allows tests which do not fit the proportionality 
criterion to be retained by placing part of their common variance 
under group factors. + 
If one were interested in attempting to infer the psychologi- 
cal nature of the general factor, he would try to determine what 
all the tests loaded on the factor had in common that was not 
shared by the tests which did not fit the proportionality cri- 
terion. In addition, he might also try to infer what the tests ap- 
pearing on a group factor have in common which is not shared 
with tests that do not appear on the group factor. More confi- 
dence would be placed in the identification of the second group 
factor in the example of Figure 1.3, since there are three tests to 
examine for a common element, whereas for the first group factor 
there are only two tests to be considered. It might be possible to 
hypothesize two different traits that these two tests seem to measure 
in common. In the absence of a third test to support one hy- 
Pothesis or the other, it would be difficult to decide which is more 
likely the common element. Among the group factors which have 
been identified by investigators using the bi-factor method are 
verbal ability, numerical ability, mechanical ability, attention, 
imagination, and such personality factors as perseveration, will, 


and oscillation. 


CHAPTER 2 


Cluster Analysis 


C LUSTER analysis, a simple form of correla- 
tional analysis, is a useful technique in itself and can also be used 
to gain further understanding of the purposes and nature of factor 
analysis. There is one type of correlational matrix which would 
give the same results whether analyzed by cluster analysis or factor 
analysis. ‘The case in which the variables are all “pure” (i.e., each 
would appear on only one common factor) gives similar results 
by the two methods. Cluster analysis is simpler to perform. 
One of the differences between cluster analysis and factor analysis 
is that in the former each variable as a unit usually is placed in 
a cluster, whereas in the latter, different portions of the variance 
of a variable may be assigned to different factors. 

There are a number of other situations in which it would be 
appropriate to cluster analyze a matrix of intercorrelations. 
When dealing with a large unfamiliar table of correlations which 
is to be factor analyzed, it is sometimes helpful to do a prelim- 
inary cluster analysis to become familiar with the variables and 
the ways in which they are related. It would also be advisable 
to do a cluster analysis for tables of correlations on which, because 
of the small N, large number of variables, or desire to have each 
variable appear in only one category, the additional computational 
labor of a complete factor analysis is not justifiable. Cluster anal- 
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ysis may also be used as a preliminary step in some of the group- 
ing methods of factor analysis, such as the multiple-group method 
(see Chapter 6). 


CLUSTER ANALysis WITH Fictitious DATA 
Several methods of cluster analysis have been sug- 
gested (Cattell, 1944a; Tryon, 1939; Holzinger and Harmon, 
1941; and Peatman, 1947). The procedure presented here uses 
Tryon’s modification of Holzinger and Harmon’s B-coefficient 
(coefficient of belonging). 
Given the intercorrelations of eight tests shown in Table 2.1, 


we may ask whether there are any clearly defined clusters of vari- 


TABLE 2.1. TABLE OF INTERCORRELATIONS 


Variable 


1 

2 
3 
4 
5 
6 
7 
8 
= 


te the operation of underlying factors. 
relationships it is convenient, first, to 
list the correlations for each variable in order of size, as has been 


done in the rows of Table 2.2. Thus, riz is between .45 and 49, 
Tıs is between .35 and .39, etc. Examination of Table 2.2 indi- 


N COEFFICIENTS FOR EACH 
ER OF SIZE 


ables which would indica 
In order to study the inte 


TABLE 2.2. CORRELATIO 
VARIABLE LISTED IN ORD 


Variable |* 


ONDUA WN 
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cates that the highest correlations are between variables 2 and 5 
and variables 3 and 6. It would probably be well to start a cluster 
of variables with one of these correlations. The work of obtain- 
ing a B-coefficient * for a cluster is best done in a table like Table 
2.3. The B-coefficient gives the ratio of the average intercorrela- 
tion of the variables in a cluster to their average correlation with 
the variables not included in the cluster. A B-coefficient of 1.00 
would indicate that the variables in the cluster correlate no more 
highly among themselves than they do with the variables outside 
of the cluster. 

It is customary to start a cluster with the two variables which 
correlate highest and to keep adding variables until the B-coeffi- 
cient shows a marked drop.? In Table 2.3 let us start our cluster 
with variable 2 and add variable 5 to the cluster (the variable 
added to the cluster is given the general designation of variable 
?). The work is outlined in the following steps: 


1. In column 1 of Table 2.3 list the variables in the cluster. 

2. In column 2 indicate the sum of the correlations in column 
i in the table of intercorrelations (when starting a cluster with two 
tests this includes the sums of the columns of both variables). 

3. In column 8 enter the sum of the correlations between the 
variable being added to the cluster and those already in the cluster. 

4. In column 4 enter the sum of all the correlation coefficients 
in the cluster. 

5. In column 5 enter the sum of the correlations of the vari- 
ables in the cluster with the variables not in the cluster. 

6. In column 6 enter the number of variables in the cluster. 

7. In column 7 enter the number of the intercorrelations 
among the variables in the cluster. 

8. In column 8 enter the number of remaining intercorrela- 


tions (i.e., between the variables in the cluster and those not in 
the cluster). 


1See Holzinger and Harmon (1941, Pp. 23-34). 


2 Holzinger and Harmon (1941, p- 27) have arbitrarily set the minimum sig- 
nificant value of a B-coefficient at 1.30, although, of course, the goal is to obtain the 
highest possible set of meaningful B-coefficients, 
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R 9. In column 9 enter the average of the correlation coefficients 
in the cluster. 
10. In column 10 enter the average of the correlations be- 
tween the variables in the cluster and the remaining variables. 
11. In column 11 obtain the ratio of the average of the cor- 
relations in the cluster (column 9) to the average of the remaining 
correlations of the variables in the cluster (column 10). This is 


the B-coefficient. 


; For the cluster consisting of variables 2 and 5, the B-coefficient 
is equal to 2.40. Is there another variable which can be added 
to the cluster and not result in a greatly reduced B-coefficient? 


TABLE 2.3. WORK SHEET AND SOLUTION FOR B-COEFFI- 
CIENTS IN CLUSTER ANALYSIS 


(5) (6) 


(4) 
Sum of Correlations |Sum of Correlations of kzNumber | (k= 1) = 
2 


‘among Voriables _|Voriables in Cluster of 

in cluster (3) +(4)| with Voriobles not |Voriobles [Number of [Remaining |Iation in [Remaining (9) + (10) 
in Cluster = (2) ~2(3) | in the intercorre- intercorre- [Cluster [intercorre 
+15) from Preceding |Cluster (4) =(7) |iohons 
Line (5) + (8) 


a (2) (3) 
Preceding] Sum of |Sum of 
Cluster +;|with n [Correlations 
Vorigbles [between / 
lond Voriables| from Preceding 
already in | Line 
Cluster 


B25) 224+ 

279=5.03 
B(2,5,8) | 2.73 1.21 me 
8(25,8,1)] 3.12 1.56 = 
B(3,6) Tasos «72 ” 


267-519 
8(3,6,4) |278 | 4.21 


B(3,6.47) | 2.85 1.53 
ee ķ 


variable 2 of Table 2.2 it may be ob- 
highest correlation. The same 
8 to the cluster gives the 


Looking along the row for 
served that variable 8 has the next 


is true for variable 5. Adding variable 
results shown in the next line of Table 2.3. The B-coefficient is 


slightly larger, and variable 8 is retained in the cluster. 

Variable 1 is the next variable to be considered for inclusion 
in the cluster, since it has high correlations with all three variables 
already in the cluster. Adding variable 1 results in a B-coefficient 
of 2.39. This is still a satisfactorily large B-coefficient, and vari- 
able 1 is retained in the cluster. 

None of the other variables seem to have higher correlations 
With the variables in the cluster than with the remaining variables, 
and the first cluster may be considered completed. To start the 
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next cluster we search for the highest correlation coefficient not 
included in the first cluster. This is T35, and the next cluster is 
started with variables 3 and 6. Their B-coefficient is 2.31. 
Variable 4 has appreciable correlations with variables 3 and 
6 and is added next. It raises the B-coefficient to 2.35. Variable 


p Cluster I 
Cluster 1] == — = — 


Variable 1 2 3 4 5 6 7 8 


FIGURE 2.1 Correlational profiles of the variables in Table 2.1. 

7 in considered next. It reduces the B-coefficient to 2.14 (see 
Table 2.3), but since this is a satisfactorily high coefficient, vari- 
able 7 is retained in the cluster. We have now assigned our eight 
variables to two clusters. Next, if desired, graphs may be drawn 
showing the profile of the correlations of each variable with the 
other variables. Variables which are similar should have similar 
profiles, and inspection of the graphs will exhibit the similarity. 
(See Figure 2.1.) 

If it is desired to identify the nature of the clusters, the remain- 
ing step would be to infer what trait the variables in each cluster 
are measuring in common which is absent from or diminished 
in the tests not in the cluster. The process of describing or nam- 
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ing the clusters is one of logical inference and is in the nature of 
an interpretation or hypothesis which may be subjected to verifica- 
| tion by further investigations. 


PROBLEMS 
CorRELATION MATRIX 


N = 250 


Variable 


l. Obtain the clusters for the above correlation matrix by the 


B-coefficient method. ; 
2. Draw a correlational-profile graph of the 8 variables. 


CHAPTER 3 


Mathematics Essential 
for Factor Analysis 


Toe MATHEMATICAL operations of factor analysis 
are based upon matrix theory, and in order to enable the student 
to follow them the fundamentals of matrix algebra and matrix 
geometry will be presented. 


MATRIX ALGEBRA 


Definition of a Matrix. A matrix is a rectangular 
or square arrangement of numbers in a table, regardless of what 
the numbers represent. A table of correlation coefficients may be 
considered a matrix. The numbers in a matrix do not have a 
predetermined arithmetical relationship to each other, and the 
matrix, considered as a whole, does not have a specific value at- 
tached to it. Table 3.1 shows a 4 X 4 square matrix. It is sup- 
posed to be written with double vertical lines on both sides to 
distinguish it from a determinant, but it is frequently written with 
single vertical lines. The letter a is used in this illustration to 
represent the numerical entry (called an element of the matrix) in 

1The mathematics of matrices was developed primarily to solve systems of 
equations. 
18 
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TABLE 3.1 
1 2 3 4 
1 aa Ta "n "14 
2 || 721 %22 223 724 eK 


a 
9 
= 
a 
D 
N 
Q 
a 
ò 
a 
a 
> 


each cell. It is not necessary to number each row and column of 
a matrix, but, where it is desired to identify each element, it can be 
done by giving first the row in which it is located and then the 
column. Thus, the symbol asz represents the numerical entry in 
the cell at the intersection of the third row and second column. 
An entire matrix may for convenience be represented by a letter, 
and the matrix in Table 3.1 is represented by the letter A2 A 
matrix with m rows and n columns is said to be of order m X n. 


A square n x n is of order n. 

The transpose of a matrix is a second 
ained from the rows of the first 
hematically in Figure 


Transpose of a Matrix. 
matrix whose columns are obt 
matrix. This relationship is represented sc 


3.la. 
The same result could have been obtained by using the col- 


umns of matrix A for the rows of its transpose, as indicated in 
Figure 3.1b. The transpose of a matrix is indicated by the addi- 
tion of a prime (’) to the letter of the matrix. 

Obtaining a transpose matrix is also illustrated by a numerical 
example in Table 3.2a and by a literal example in Table 3.2b. 

A more generalized definition of the transpose of a matrix is 
that it is a matrix with the rows and columns of the original 
matrix interchanged. Thus reading down column 3 of matrix B’ 
in Table 3.2a gives the same values as reading across row 3 of 
matrix B, and reading across TOW 


3 of matrix A’ gives the same 
values as reading down column 3 of matrix À. 
~ 


2 Representing a matrix by 4 letter is an exampl t 
extensive use of atx notation in most books on factor analysis puts them beyond 
the comprehension of many students who might otherwise understand the con- 
cepts. Limited use of matrix notation will be made in this book. 


e of matrix notation. The 


oOo op WH = 


Gubsu n= 


Matrix A 


OunkhwWwWn = 


%61 [62 O63 


Matrix Ç 


awn WH = 
nr ý 
—$$_$______ wy) 

wn 


Introduction to Factor Analysis 


sti 


(Transpose of Matrix A) 


FIGURE 3.la 


123 4 6 6 
—_—S 


— 
Matrix A’ 
Ficure 3.1b 
TABLE 3.2 
3 
35 
Be] 1234 8 6 | 
9 1 t a ai JD 36 
6 2 6 A 8 18 oS 
F If 65 3 9 aaa 
8 
Matrix B' 
(a) 
1 2@ 3 & 6 
1jja 


Matrix C’ 
(5) 
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Determinants. A determinant is similar to a matrix in that it 
too is a table of numbers. It differs from a matrix in that it (1) is 
always square, and (2) has a definite numerical value associated 
with it. An n X n determinant is said to be of the order n. A 
determinant is written with single vertical lines on the sides to 
distinguish it from a matrix, which is written with double vertical 
lines. Table 3.3 illustrates two determinants, one of order 2 and 


one of order 4. 


TABLE 3.3 
Oy Ta Sa Cia 
3 M2 aa %2 %23 724 
On 032 Gy %2 33 734 
a, 


Oa Oaz a3 


A Fourth- order 


A Second-order - 
Determinant 


Determinant 
; This is not the place to give the rules for numerically evaluat- 
ing determinants. They can be found in any textbook of college 
algebra.* The numerical value of a second-order determinant is, 
however, easily obtained. It is given by the expression 411422 — 
4214), where the numerical values are arranged as in the 2 X 2 
determinant in Table 3.3. Of course, if the product d11422 is equal 
to the product a21412, the determinant is equal to zero and is said 


to “vanish.” 

Although matrices and determi- 
some purposes to consider 
to evaluate it according to 


Determinant of a Matrix. 
nants are different, it is desirable for 
a square matrix as a determinant and 
the rules for evaluating determinants. 


Minors of a Determinant. If one row and one column of a 
ated, the remainder is a deter- 


determinant of order n are elimin ; 
minant of order n — l. This determinant of order n — 1 is 
referred to as the first minor of the larger determinant. If a first 
minor were obtained, for example, from the 4 X 4 determinant 
in Table 3.3 by striking out the second row and third column, it 
Would leave the third-order determinant shown (Table 3.4), 
~ 

rse in College Algebra, The Mac- 


_3See, for instance, W. B. Ford, A Brief Cou 
millan Co., New York, 1932, Chapter XIV. 


N 
w 
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TABLE 34 


which would be referred to as the first minor, a23 (the intersection 
of the row and column crossed out). 

For a 4 X 4 determinant there are 16 possible first minors. 
For an n X n determinant there are n? first minors. 

Similarly, if any two rows and any two columns of an n X ” 
determinant are eliminated, we have remaining a determinant of 
order n — 2. Such determinants are known as the second minors 
of the n X n determinant. The 2 X 2 determinant in Table 3.3 
is one of the second minors of the 4 X 4 determinant. 


Rank of a Matrix. One type of problem approached by factor 
analysis may be stated as follows: given an n X n matrix of cor- 
relations, are n independent reference variables needed to account 
for the relationships represented by the correlation coefficients, 
or can the relationships be accounted for by a linear (i.e., addi- 
tive, weighted) combination of fewer than n reference variables? 
‘The rank of the correlation matrix is the number of reference 
variables needed to account for the intercorrelations. 

One way of determining the rank of a matrix is to find the 
order of its highest nonvanishing minors. For example, if the 
determinant of a 4 X 4 matrix when evaluated does not vanish, 
its rank is 4. If the determinant is equal to zero, the matrix is 
of rank 3 or less. If one or more of the first minors of this 4 X 4 
matrix do not vanish, the rank of the matrix is 3. If all 16 first 
minors (which are of order 3) vanish, the rank of the matrix is 
less than 3. The mathematics of factor analysis is primarily 
adapted to testing the ranks of matrices.* 


4It is of historical interest to note that Thurstone (1947c, p. vi) defined the 
multiple-factor analysis problem in these terms. He writes: “In 1931 I decided 
to investigate the relation between multiple-factor analysis and Spearman’s tetrad 
differences. When I wrote the tetrad equation to begin this inquiry, I discovered 
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Some Common Types of Matrices. 


A. Square Matrix 
A square matrix is one having the same number of columns 


and rows. 


awn 
oon 
un = 


A Square Matrix 


B. Symmetric Matrix 
A symmetric matrix is a square matrix with equal values 


in corresponding positions relative to the principal diag- 
onal (which extends from the upper left-hand corner to 
the lower right-hand corner and includes cells a1, 4&22, 


ass, etc.). Thus matrix A is symmetric if a12 = G91, o3 = 


ass, and di3 = 431- 
ai Fie 43 
as, Foo %3 || =A 
93, %32 733 


A Symmetric Matrix 


Matrix B is also a symmetric matrix. 


a2 2364 

3.9.8 =B 

4.8.5 

ranspose matrix are equal. 


A symmetric matrix and its t 
(Two matrices are equal if their corresponding elements 


are equal.) A table of jntercorrelations is, of course, a 
symmetric matrix, usually with no numerical entries in 


the principal diagonal. 


———. 

that the tetrad was merely the expansion of a second-order minor and the rela- 
tion was then obvious _ If the second-order minors must vanish in order to 
a ird-order minors vanish in order 


then must the th 


€stablish a single common factor, ; 
and so on?’ 


t : 
© establish two common factors, 
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C. A Diagonal Matrix 
A diagonal matrix is one which has values greater than 
zero in its principal diagonal while all other elements are 


TOS. 
Zeros A o o 
O a 0 
0 0 a% 


ADiagonal Matrix 
D. The Identity Matrix 
A diagonal matrix that has principal-diagonal values all 
equal to 1, while all other values are zeros, is called an 
identity matrix and is often represented by the letter I. 
Its role in matrix mathematics is somewhat analogous to 
the use of the number J in arithmetic. 


The Identity Matrix 


Matrix Multiplication. Matrices may be multiplied. The 
rules for matrix multiplication are more complicated than for 


nm = 
one 


= 


1 2 1 2 3 
l i TTT 
2 | ——> 2 cles Ei Te 
3 | ———>- 3 + Fe ya 
4 || ———> 4 4 a Ei q 
A c 


FIGURE 3.2 Illustration of the row-by-column rule of matrix multiplication. 


N 


w 
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arithmetic multiplication, but they can be mastered with a little 
application. The first respect in which matrix multiplication 
differs from arithmetic multiplication is that the order in which 
the multiplication is done must be specified. “Thus, the product 
of matrices A-B is usually not equal to the product of matrices 
B-A. In the former case matrix A is said to be post-multiplied 
by B and in the latter case matrix A is said to be pre-multiplied 
by B. 

In order for it to be possible to post-multiply matrix A by 
matrix B, the number of columns of the former must be the same 


TABLE 3.5. MATRIX MULTIPLICATION 


1 2 
1| oy r 12 3 
2)! az 022 n 1 || bn bie 413 
3]] o3 %e 2 || ba b22 423 
A| aa %a2 
A 8 


) 


(ay, biz + O baa) (ayy by3 + M2 b23 
) 


(ayy bis + aga bar? 
6. 


laz biz + 2 be ) (aay 443 + a22 as 
(05, biz + a32 S22) (31 biz + 932 623) 
(a4, 642 + aaz b22 Vlog, 643 + 24223) 


lazy by, + a22 bar? 
lay by + agba) 
(04,54 + O49 bay) 


c 
(a) Literal Example 


i 2 1 
| oe 1||.9 .4 à 
Blin 2 ý Bl ge fli ld 
3]| .4 .8 
ajs .2 
A B 
1 2 3 
q || cea fl 52 
2|| 60 .27 .39 
3 || 52 .24 68 
a || .49 22 .29 
c 


(b) Numerical Example 
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more complex process than division. Since, however, it is reid 
sary to obtain the inverse of a matrix in several problems in — 
analysis, one method will be described.* This method is base 
upon the following principles: 


1. There are certain operations that can be performed on the 
rows of a given matrix A by which it may be reduced to the iden- 
tity matrix.® Ea À 

2. These same operations applied in the same order fo, Ene 
rows of an identity matrix will yield the inverse (A?) of the given 
matrix A. 


Computation of the inverse of matrix A is illustrated in the 
following steps: 


TABLE 3.6 
2 A 7 1.0 .0 © 
6 S 01.0 .0 
a 8 32 Oo .O 1.0 
cA I 


l. Divide each number in the first row of matrix A by the 
value in the principal diagonal for that row (.2). Divide the num: 
bers in the first row of matrix I by the same value (.2). This 
gives the following values in the two matrices: 


10 2.0 35 50 0 0 
& & 5 0 10 0 
3 8 #3 0 o 1.0 


2. Now that we have a 1.0 at the top of column 1 for the 
matrix at the left, we wish to transform the remaining values an 
the first column into zeros, in order to get it into the identity 
matrix form. This can be done for the second row by subtracting 
0.6 of each value in the first row from the corresponding value 1? 


8 Other methods are described in the following references: Thurstone aa 
Tucker (1938), Fruchter (1949d), Deemer (1947), Dwyer (1951), and Andree (1954 
® The permissible operations are as follows: 
1. Dividing all of the elements in a row by a constant. R row 
2. Adding or subtracting some fraction or multiple of the elements in one 
m the corresponding elements in another row of the matrix. 


$ a x all 
These operations may also be applied to the columns of matrices but we sh 
not do so here. 


fro 
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the second row. (Whatever arithmetical operation is performed 
on one value in a row must also be performed on the other values 
in that row.) Likewise, for the matrix at the right, 0.6 of each 
value in the first row is subtracted from the corresponding value 
in the second row. 


10 20 35 50 0 O 
O14 “16 -3.0 10 0 
3 8 3 D 0 1.0 


3. Similarly, in order to obtain a zero for the value in column 


1, row 3, we subtract .3 of each value in row l from the corre- 
sponding value in row 3 and perform a similar operation on the 
matrix at the right. For a matrix of more than 3 rows a similar 
process is continued until all the entries in the first column below 
the first row of the matrix on the left are zeros. It should be noted 
ns are performed on the rows, it is the 


that while the operatio 
atrix form one at a time. 


columns that are reduced to identity-m 


1.0 20 3.5 50 0 0 
0 -14 -1.6 -30 1.0 0 
O 2 S75 -15 0 1.0 


4. Divide each number in the second row of the matrix on 
the left by the value in the principal diagonal of the second row 
(—1.1), giving 1.0 in that cell. Divide each number in the second 
row of the matrix on the right by the same value (—1.1). 

5.00 .00 .00 


2.73 -.91 .00 
-1.50 .00 1.00 


1.00 200 3.50 

00 1.00 1.45 

00.20 -.75 

5. In order to change the other values of the second column 

to zeros we go through a process similar to that applied in column 
l. Subtract 2.0 times each value in the second row from the 
corresponding value in the first row. Subtract 2 of each value 
in the second row from the corresponding value in the third row. 
Do this for both matrices. If there are more than three rows, get 


the product of the second value in each row and each value in 
the second row and subtract it from the corresponding value in 
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the row being worked on. The first two columns of the matrix 
on the left have been reduced to identity form as indicated below. 


1.00 .00 .60 = .46 1.82 .00 
-00 1.00 1.45 273:= 91 00 
.00  .00 -1.04 -2.05 .18 1.00 


6. Divide each value in the third row of both matrices by the 
value in the principal diagonal of that row in the left-hand matrix 
(—1.04). Always start reducing a column by changing its value 
on the principal diagonal to 1.0. 


1.00 .00 .60 —.46 1.82 .00 
00 1.00 1.45 2737.91 00 
00 .00 1.00 1:97 = .17 =.96 


7. To change the values in the first two rows of the third 
column of the matrix on the left to zeros, subtract .6 of each 
value in the third row from the corresponding value in the first 
row, and 1.45 times each value in the third row from its corres 
sponding value in the second row. Perform the same operations 
on the rows of the matrix on the right. 


1.00 .00 .00 -1.64 1.92 .58 
.00 1.00 .00 = 13:— .66: 4.39 
00 .00 1.00 1.97 - .17-— .96 

I A? 


The matrix on the left has now been reduced to the identity 
matrix, and the matrix on the right should be A-t or the inverse 


of A. The results may be checked by determining whether A-A 
gives the identity matrix. In order to do this we must, of course, 


apply the rules for multiplying matrices. Multiplying the tw 
matrices we find that: 


2 a g -1.64 1.92 .58 ||" 1.00 .00 .00 
ut Bil « - 13 -.66 1.39 = .00 1.00 .0O1 
B A 38 1.97 -17 - .96 .00 .00 1.00 


A at I 
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The inverses of larger matrices can be found by the same 
routine operations.*° 


Matrix GEOMETRY 

Although it is not necessary to use graphical repre- 
sentation and geometrical concepts to work with factor-analysis 
problems, it is often helpful in order to visualize what is happening. 


Geometric Representation of a Correlation Coefficient. If 
each test or variable being correlated is represented by a vector 
(V),™ the correlation between two tests can be shown to be equal 
to the product of the length of the two vectors and the cosine of 
the angle ($) between them, as is shown in the following 


formula: 12 
å rig = ViVa cos $12 (3.1) 


of unit length, the correla- 


If each test is represented by a vector 
f the angle between them, 


tion coefficient is equal to the cosine o 


r =1.00 r =—.50 
(c) (d) 


r=.00 r =.707 


(a) (6) 


Ficure 3.3 Vectorial representation of correlation coefficients. 


as can be seen from the above formula. Since the cosine of an 
angle of 90° is 0.0, an r equal to zero can be represented by two 
unit-length vectors from a common origin at right angles (orthog- 
onal) to each other as in Figure 3.3a. The cosine of 45° is equal 
to .707 (approximately), so an 7 of .707 can be represented by two 
unit-length vectors with an angle of 45° between them as in 
Figure 3.3b. The cosine of 0° is 1.0, and an r of 1.0 can be rep- 
resented by two unit vectors with an angle of 0° between them or, 


heet for this method. : 
ine with a given starting po! 
s the scalar product. 


10 See p. 145 for a work sl 

11 A vector is a straight 1 

12 This product is referred to a 
90) for a proof of this equation. 


nt, length, and direction. 
See Thurstone (1947c, p. 
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in other words, by one unit vector collinear with the other as in 
Figure 3.3c. Any r between zero and +1.0 can be similarly repre 
sented by separating two unit vectors with the angle between 0 
and 90° whose cosine is equal to r. Negative correlations can be 
represented similarly, but the angle between the vectors is obtuse 
(between 90° and 180°). An r of —.50 can be represented by 
two unit vectors separated by 120°, since the cosine of 120° 1s 
—.50 as in Figure 3.3d. Any r between zero and —1.0 can be 
represented by two unit vectors separated by the angle between 
90° and 180° whose cosine is equal to r.!? 


Graphical Representation of a Table of Intercorrelations- 
The diagram in Figure 3.4 is an attempt to represent all of the 
intercorrelations in Table 3.7 geometrically 

in two dimensions. The correlations be 


1 
3 A 120° tween variables 1 and 2 and | and 3 are cor- 
E rectly represented. It will be observed, how- 
2 ever, that the angles between vectors 1 and 

to] 


2 and vectors 1 and 3 do not add up to 

FIGURE 3.4 Illustration 9 would 

to show that the correla- angle between vectors 2 and 3. It mS 

tion matak in Table 3.7 therefore be necessary to resort to a three 
cannot be represented in : x > r 

two pE bi ai dimensional figure in order to represent the 

three angles correctly. This may be Te- 

garded as a geometrical interpretation of the rank of a matrix- 


The rank of a matrix is indicated by the number of dimensions 


TABLE 3.7. CORRELATION MATRIX 


1.000 .000 .707 
-000 1.000 -.500 
-707 =.500 1.000 


required correctly to represent the vectors and their angular sep- 
arations. If they can be represented in one dimension (a straight 
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line), the rank of the matrix is one, and only one reference vector 
(factor) is needed to represent the relationships. This would be 
mie for four variables which satisfied Spearman’s tetrad-difference 
criterion. If the vectors and their angular separations can be 
represented in two dimensions (a plane), the rank of the matrix 
is two, and only two factors are needed to account for the inter- 
relationships. If the vectors and their relationships can be dia- 
gramed correctly only in three dimensions, the rank of the matrix, 
and hence the number of factors, would be three. Although we 
cannot visualize beyond three dimensions, the analogy holds. A 
matrix of rank n requires n dimensions to account for the inter- 
relationships and has n factors. 


TABLE 3.8. INTERCORRELATIONS AMONG FOUR 
VARIABLES 


(1.00) .80 .96 -60 
.80 (1.00) .60 .00 
.96 .60 (1.00) .80 
60 .00 .80 (1.00) 


wno- 


ntation of a Table of Correlations in Two 


Graphical Represe 
lled from plane geometry that two 
pP d 


Dimensions. It will be reca 
intersecting straight lines determine a 
plane. Any other line in the same 
plane can be expressed in terms of the 
first two lines and is said to be linearly 
dependent on them. The intercorre- 
lations of 4 variables shown in Table 
3.8 have been represented graphically 
in two dimensions in Figure 3.5. 
Since the relationships can be shown 
correctly in two dimensions, the rank 
of the 4 X 4 matrix is 2, and only 2  Fycure 3.5 Vectorial represen- 
independent vectors are needed to tation of —— ebeen! 
account for all the interrelatio 


nships. 
All four vectors can be expressed in terms of any two of the vec- 
tors. Vectors 2 and 4 are the most conv 


enient, since they are at 
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rs can 
right angles (orthogonal) to each other, and the four vecto 
be represented by the following equations: ** 


V, = 8V.+ 67, 
Va = 1.073 + OV, 
V3 = 6V + 8r, 
V, = OV. 41.07; 


This is the equivalent of a factor analysis of the four — 
in Table 3.8, although analyses usually are not done nega oe 
Also practical problems are not so simple as this one and are sec 
by other methods to be described later. It is seen that ee op 
ables can be accounted for in terms of two reference variab f a 
factors, anda consequent parsimony is effected. When an ane EN 
has been completed, resulting values are usually written in a 
breviated matrix form as in Table 3.9, The values in colum 


TABLE 3.9. FACTOR MATRIX 


ESES 


Variable 


A š may 
2 and 3 are referred to as factor loadings or saturations and 
i ; S. 
be used to interpret the hypothetical nature of the factor 


14 The equation: 
into component vector: 


two such component 
gonal case can be vi Foi 
hypotenuse of a right triangle is e each 


Vector 1 


1.0, 6 
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col 2 3 
aikaa a r oh E 
interpreted as that portion of tł i ae ps sae a art 
ea: ie P ie ariane of each variable which 
ome i 1 the other variables. (These values are also 
=a 1e principal diagonal of Table 3.8.) The length of a 
Erike 2 pt to h (the square root of the communality). 
aeia ctitious data presented above the communalities, and 
moc ia i vector, are all equal to 1.0. For actual data the 
eet ality rarely attains a value of 1.0 due to unreliability and 
k ee content of the variables. If the length of 

is less than 1.0, the correlation with other variables is 
Sien decreased since, it will be recalled, the correlation 
es a to the scalar product of the length of the two vectors and 

osine of their angular separation (formula 3.1). 

It may be observed from the example just presented that factor 
analysis is partly a method for determining (a) the number of 
dimensions needed to account for the interrelationships (correla- 
tions) of a set of variables, and (b) a set of arbitrary reference 
Vectors (factors) in terms of which the vectors representing the 


ji a can be expressed. 
„Reproducing the Table of Correlations from the Factor 
dings have been obtained, it is pos- 


Matrix. Once the factor loa 

sible to work back to obtain the correlations. The correlations 

may be computed by summing the cross-products of the rows of 
multiply the values in 


the factor matrix.’ Thus to get 72 (.80); 


tow 1 by the corresponding values in row 2 and summate (.6 X 0.0 
e cross-products for rows 


+.8 X 1.0). For rz, (-00), summate th 
2 and 4 (.0 x 1.0 + 1.0 X .0). In this way an entire matrix of 
Correlations can be “reproduced” from the factor loadings. 


véraphical Representation of 
Dimensions. The correlation CO 
represented graphically in three dimensions. 
in Table 3.10 that variable 3 has zero correlatio 


a Correlation Matrix in Three 


efficients in T ‘able 3.10 can be 
It will be observed 


n coefficients with 


matrix is equal to the product 


£ 15 In matrix notation R = FF’, or the correlations 
oË the orthogonal factor matrix and its transpose. In the above example we can 
are multiplying not by F’ but by its transpose. 


maid 
rp red rows by rows because We 
ce note 5 on p. 26.) 
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TABLE 3.10. INTERCORRELATIONS OF TEN VARIABLES 


10 
.70 
.70 
.50 
-36 J 86 
48.00.96 1.00 .80 .48 .36 60 82 
60 .00 .60 80 1.00 .00 .00 .00 .50 
48.60 64 .48 00 100 .96 .80 _.86 
64 80 48 36 .00 .96 1.00 .60 .82 
00.00.80 .60 ~.00 80 60 1.00 .70 
70.50 86 .82 .50_ .86 .82 70O 1.00 


9 


variables 6 and 9. Variables 6 and 9 are also correlated zero with 
each other. The vectors for these three tests are at right angles 


(b) (c) (d) 
FicuRe 3.6 Graphical representation of correlation coefficients in Table 3-10- 


to each other (cosine of 90° = 00). Unit-length vectors for the 
ten variables in Table 3.10 are shown in their “correct” positions 
in Figure 3.6a (i.e., all the unit length vectors are separated by 
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angles whose cosines are equal to the correlations between the 
variables). Vectors 1 and 2 are in the plane of vectors 3 and 6 
and can be expressed as a linear combination of 3 and 6. Vectors 
4 and 5 are in the plane formed by vectors 6 and 9 and can be 
expressed in terms of them. Similarly vectors 7 and 8 are in the 
plane formed by vectors 3 and 9. Vector 10 is not in any of these 
planes but is in the space bounded by vectors 3, 6, and 9 and can 
be expressed in terms of all three. If vectors 3, 6, and 9 are used 
as reference vectors (and called factors II, I, and III, respectively), 
we can read the loadings in the factor matrix in Table 3.11 from 


TABLE 3.11. FACTOR LOADINGS 


Factor 


Variable 


wORDOTDCOO®D 


ou Man OuKRW ND 


an be expressed as .8 of factor I (vector 6), 
.6 of factor II (vector 3), and .0 of factor III (vector 9). The 
loadings for the other variables can be read similarly from the 
diagram. The reader can, if he desires, verify that the correlation 
coefficients can be “reproduced” from the summation of cross- 
products (e.g., 12 = 6 X 0.8 +.8 X 6+ 0x 0S .96). Since 
the vectors representing the variables used in this example had 
unit length, the communality (°) of each variable is equal to 1.0. 

The loadings of the variables on the factors can also be shown 
graphically in one plane at a time. This makes possible the 
graphical representation of any number of dimensions, two di- 
mensions at a time. Figures 3.6), c, and d show the three-dimen- 
sional example in Figure 3-64 in such two-dimensional graphs with 
the lower-numbered factor placed on the horizontal axis. To get 


Figure 3.6a. Vector 1 ¢ 
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a complete graphing for n factors requires n(n — 1)/2 graphs. 
These graphs can be made up by considering the factor loadings 
in each row of Table 3.11 as the coordinates of a test vector and 
plotting two columns at a time. It is customary in factor analysis 


to represent a vector by just a dot placed at its head, it being 
understood that the vector extends from the origin (intersection 


of the reference vectors) to the dot. 


Rotation of Reference Axes. A perennial criticism of factor 
analysis has been that there is no unique location for the frame 


TABLE 3.12. FACTOR MATRIX 


Factor 


Variable 


of reference vectors. A given table of correlations has but a single 


configuration of vectors, but this configuration can be described 
with respect to an infinite 


0 number of locations of the 
frame of coordinate reference 
axes. Various locations of the 
reference frame can be ob- 
tained by rotating the refer- 
ence axes about the origin. 
The four variables whose 
factor loadings are shown in 
Table 3.12 are represented 
graphically on two orthogonal 
reference axes (I and II) in 


Figure 3.7. 
Ficure 3.7 Graphical representation of Any degree of rotation of 
Table 3.12 and rotation. 
the rotated reference axes (I 


and II) around the origin and in the plane could be used to repre- 
sent the configuration of test vectors and hence the table of inter- 


Mathematics Essential for Factor Analysis 39 


correlations. However, with each new location of the axes the 
projections or loadings of the tests on the factors would be dif- 
ferent. Axes may be rotated clockwise or counterclockwise any 
desired number of degrees without affecting the correlations 
among the variables. For a clockwise rotation of a pair of orthog- 
onal axes I and II, the rotated factor loadings for a variable f may 
be calculated from the equations 


a's, = a3, COS $ — Go Sing $ , 
¥ (clockwise rotation) 
a'j2 = ajı Sin @ + je COS $ 


where ¢ represents the angle through which the axes are rotated, 
a, represents the rotated loading of test j on factor I, a'j repre- 
sents the rotated loading of test j on factor II, aj: is the loading of 
test j on factor I prior to rotation, and aj is the loading of test j 
on factor II prior to rotation. 

For rotation of a pair of orthogonal references axes in the 
counterclockwise direction the formulas are 


a’j, = ajı cos $ + ajo sin $ : : 
d (counterclockwise rotation) 
a'j = —4j sing + aj. cos $ 
The rotation can also be written in matrix form. For clock- 
wise rotation the matrix, known as the transformation matrix 


and represented by the symbol A, is 


2 orthogonal axes rotated 
clockwise through an angle 
of ọ degrees) 


| cos@ sing | (transformation matrix for 


—sing cos¢ 


and for counterclockwise rotation 


cos@ — sing (transformation matrix for 
sind cos é 2 orthogonal axes rotated 
counterclockwise through 


an angle of ¢ degrees) 


The matrix multiplication of the factor matrix by the trans- 
formation matrix yields the rotated factor matrix: 
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Factor Factor 
F-AN =F, 
cos@ —sing 
sin } cos } 


in which a’, = a1 cos $+ G2 sin ¢, @’y2 = di (— sin ¢) + 42 
cos ¢, etc. 

If factors I and II are rotated clockwise 50°, the rotated factor 
loadings can be read either directly from the graph or computed 
from the equations. The sine of 50° = .766, the cosine of 50° 
= .643. 

The rotated loading of test 1 on factor I, is 


(.6)(.643) — (.4)(.766) = (.3858) — (.3064) = .0794 
The rotated loading of test 1 on factor II, is 
(.6)(.766) + (.4)(.643) = (.2572) + (.4596) = .7168 


The loadings of the other tests have been similarly computed 
in Table 3.13. 


TABLE 3.13. COMPUTATION OF FACTOR LOADINGS FOR 
A CLOCKWISE ROTATION OF 50° 


The rotated loadings to two decimal places are reported in 
Table 3.14. The communalities computed from the rotated load- 
ings are a partial check on the accuracy of the calculations. Com- 
munalities should remain invariant within the margin of round- 
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TABLE 3.14. ROTATED FACTOR LOADINGS 


ing error regardless of the amount of rotation or the number of 
factors.*® 

The rotated loadings can be obtained similarly either graph- 
ically or analytically for any degree of rotation of a pair of orthog- 
onal axes clockwise or counterclockwise. 

Considerations in determining the amount and direction of 


rotation will be presented in Chapters 7 and 8. 


Summary of Vector Representation of the Correlation Matrix. 
The following summary of geometrical concepts which should 
serve as a useful review at this point has been adapted from Thur- 
stone (1947c, pp. 98-100). 

1. A frame of reference axes must be inserted into the test 
configuration before any factor matrix can be written. 

2. Since the coordinate axes are not determined by the experi- 
mentally obtained correlation matrix, every factor matrix is in the 
nature of a factorial interpretation of the given correlations. 


mmunality (h2) remains invariant under rotation may 


16 The rea shy the co! 
ese aia" panying diagram. From the Pythagorean 


be clearer from considering the accom 
theorem of geometry: 


2 


2 2 za 
aj? + an? = h? and d'p Ha’ =E hi 
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3. Each row of the orthogonal factor matrix can be interpreted 
as showing the rectangular components of a test vector. 

4. Each column of a factor matrix identifies a factorial axis in 
the structure. 

5. Each reference factor is represented by a unit-length refer- 
ence vector (which can extend from +1.0 to —1.0). 

6. The graphs on which are plotted pairs of columns of the 
factor matrix show the factorial structure which combines the 
test vectors and the reference frame. 

7. If the entries in an orthogonal factor matrix are all positive 
or zero, all test vectors are in the first quadrant (or octant, etc.) 
and the structure has a positive manifold. 

8. The correlation matrix can be interpreted as showing the 
scalar products of all pairs of test vectors, 

9. The length of each test vector is equal to the positive square 
root of its communality. 

10. The correlation matrix with known diagonal entries deter- 
mines a unique configuration of test vectors that can be repre- 
sented as a physical model if the rank of the matrix is less than 4. 

11. The correlation matrix does not determine any reference 
frame. 

12. No information is lost in representing the correlation 
matrix as a configuration or vice versa. 

13. A positive correlation coefficient represents a pair of test 
vectors with acute angular separation. A negative correlation 
represents a pair of test vectors with obtuse angular separation. 

14. A correlation matrix with coefficients that are all positive 
or zero determines a unique configuration of test vectors in which 
there are no obtuse angular separtions, 

15. The signs of all the entries in a column of the factor matrix 
may be reversed without altering the corresponding correlation 
matrix. 

16. Reversing the signs in a column of the factor matrix re- 
verses the direction of Ge, reflects) the corresponding reference 
vector and reference factor. 

17. If all of the signs are reversed in a row of the factor matrix, 


then all of the signs are reversed in the corresponding row and 
column of the correlation matrix. 
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18. To reverse the signs in a row of the factor matrix reverses 
the direction of the corresponding test vector. 

19. The self-correlation of a test remains positive for all pos- 
sible reversals of sign of tests and factors. 


PROBLEMS 


1. Post-multiply matrix A by matrix B and label the product 
matrix C. 


6 9 3 4 62) fl 

e sh 32 A $ t 

1 E 8.9 6 
A B 


2. Find the inverse of matrix A. 

3. Check the calculation of the inverse matrix by the multiplication 
A'A- =L 

4. Check the multiplication in Problem 1, above, by the multiplica- 
tion B = A~3C, 

5. Post-multiply matrix B by matrix A and compare the results 
with the multiplication AB that you performed in Problem I 

6. Rotate factors I and II in Table 3.12 counterclockwise 50°. 


CHAPTER 4 


Basic Assumptions 
of Factor Analysis 


First ASSUMPTION 


A BASIC assumption of factor analysis is that 
a battery of intercorrelated variables has common factors running 
through it and that the scores of an individual can be represented 
more economically in terms of these reference factors. An indi- 
vidual’s score on a test is dependent upon two things, the par- 
ticular abilities assessed by the test and the amount of each of 
these abilities possessed by the examinee. This can be repre- 
sented for r independent * factors by the equation: 


Sji = Aii F joXoe e+ + aj px ys (4.1) 
where sj represents the standard score of individual i on test j 
aj, the factor loading of test j on factor r, and x»; the amount, in 
standard scores of the uncorrelated trait measured by factor 7 
which is possessed by individual i. 


It is to be hoped, in the interests of parsimony, that the refer- 
ence traits are fewer in number than the tests. 


1The formula is more complex for correlated or “oblique” factors. For an 
advanced discussion of whether equations of the type of equation 4.1 can result 
in a unique set of loadings or weights, see Kestelman (1952). 
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It is of interest to note that even if all human behavior could 
be accounted for by a limited number of reference abilities, there 
would still be sufficient combinations of terms in equation 4.1 to 
insure each person in the world a unique pattern of abilities. 

Types of Variance. The total variance * of a variable 7 may 
be subdivided into three general types—common, specific, and 
error variance. The common and specific variance combined 
make up the reliable variance and their combined value is usually 
indicated by the reliability coefficient. 

Definition: Common Variance is that portion of the reliable 
variance which correlates with other variables. 

Definition: Specific Variance is that portion of the reliable 
variance which does not correlate with any other variable. 

Definition: Error Variance is the chance variance due to errors 
of sampling, measurement, unstandardized conditions of testing, 
physiological and other changes within the individual, and the 
host of other influences which may contribute to unreliability. 
It is assumed to be uncorrelated with the reliable variance. 

One function of factor analysis is to analyze the common vari- 
ance to determine the number and types of common variances 
which result in the correlations between variables. The total 
variance of a variable j may be represented by the following 


equation: ë 


Peet ai + oie oj? (4.2 
oj? = op? + oj? +t Gir + aj? + oye? (4.2) 
common variances specific error 


variance variance 


Dividing both sides of equation 4.2 by oj. 


? oj? | Tia oir? | Tj | Tie” (4.3) 
j j1 j f 
T =10= 47 ae a T 
o;” oj oF j j 


in standard-score form with the total variance 


0 and each type of variance expressed as a 
For convenience we will repre- 


The equation is now 
of variable j equal to I. 
proportion of the total variance. 


i variable, dis- 

2 Variance (e?) is an index of the extent to which a test, oF other 
criminates individual differences. l 
3 There are assumed to be r contributors to common V 


r stands for the rank of the correlation matrix. 


ariances where the symbol 


46 Introduction to Factor Analysis 


sent each type of variance by a letter symbol and equation 4.3 
may be rewritten as follows: 


1.0 = ayy? + aj? + GP + sF + a Z (4.4) 


-$ 


common variances specific error 
variance variance 


The values of the square roots of the common variances (aj 
through aj) are referred to as the factor loadings or saturations 
and (for independent factors) represent the amount of correlation 


Reliable Variance 


— ~v IN — "Ta am] 
Common Variance Specific Error 
(A,?) Variance Variance 


Ficure 4.1 Diagrammatic representation of the total variance of a variable sub- 
divided into types of variance (in standard scores). 


of test j with each factor. The relationship may also be repre- 
sented diagrammatically as in Figure 4.1. This represents the 
additive assumption of factor analysis that the total variance of 
a test is the sum of its component variances. The additive com- 
bination is a first approximation, appropriate to the present stage 
of accuracy of psychological measurements. Later refinements 
may require ratios, products, or more complex types of combina- 
tions of factors. 

It might be pointed out here that for purposes of prediction 
and the scientific identification of fundamental traits we are in- 
terested primarily in the common variances, since a variable will 
predict only to the extent to which it correlates with some cri- 
terion or other variable. The specific variance, although it may 
be reliably measured, is of little scientific interest since it is not 
related to anything but itself. 


From equation 4.4 we can deduce some further relations and 
definitions. 


* Specific variance, however, may serve to indicate areas which need further 
investigation, since it is potentially common-factor variance. 
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Definition: The communality of variable j is the sum of its 
independent common variances and is represented by the symbol 
hj. 

an? Haj? +++ ay? = h (4.5) 


The total variance of variable j may now be represented by 
hj? + 5? + e? = 1.0 (4.6) 


Since, as has been previously noted, the reliable variance of 
a test is the sum of its common and specific variances, it may be 
represented by the equation * 


ry = hp a a ef? (4.7) 


The communality of a variable is less than its reliability except 
for the limiting case in which the specific variance is equal to 
zero. Other relationships derivable from equation 4.7 which it 


may be of interest to observe are as follows: 


hj? = 143 — 57? (4.8) 
sP = ry AP (4.9) 


Definition: The uniqueness of variable j is that portion of the 
total variance which it does not have in common with any other 


variable and is represented by the symbol u?. 


wp = sf te =l- i? (4.10) 


From equations 4.6 and 4.7 we also have 


e? = = (h? + s’) =1— Tjj (4.11) 


SECOND ASSUMPTION 
A second assumption of factor analysis is that the 


correlation between two variables j and k can be accounted for by 


fused with 7jx, the correlation 


2 iability coefficient (not to be con 
Tjj, the reliability ( ean 


coefficient) represents a proportion of variance and does not ne 
as do the other symbols in the formulas. 
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the nature and extent of their common factor loadings. For or- 
thogonal factors this can be represented by the equation: ° 


Tjik = Ajari + Ajodng +++ + AjrAkr (4.12) 
This is the summation of cross-products referred to in Chapter 
3, page 35, in reproducing correlation coefficients. 

A number of interesting relationships among tests can be 
illustrated by the application of equation 4.12. Let us assume 
a matrix of factor loadings for three tests and a criterion on four 
factors as shown in Table 4.1. This can also be represented 


TABLE 4.1 


schematically as in Figure 4.2, where the component variances are 
shown. 


The correlation between tests 1 and 2 can be computed as 
follows: 


ria = (.7)(.0) + (.0)(.6) + (.4)(.0) + (.3)(.2) = .06 


The correlation between tests 1 and 2 is seen to be due to their 
common loadings on factor IV alone. 


The correlation between tests 1 and 3 is equal to 


ris = (-7)(8) + (.0)(1) + (.4)(.5) + (.3)(.0) = .86 


The correlation between these two tests is due, in large part, to 


their common loadings on factor I and to a lesser extent to their 
loadings on factor III. 


6 This equation is derived from the basic 


equation of factor analysis for inde- 
pendent (orthogonal) factors, which in terms 


of matrix notation is 
R= FF (4.13) 


The equation indicates that the product of the factor matrix and its transpose is 


equal to the correlation matrix. 
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49 46 .09 12 14 
a ria, as sf e? Test1 
36 .04.10 .50 
| O29" je eg 2] Test 2 
aza 
.64 Ol .25 .05.05 
Oos [ee 
a: Sf ef 


j 40 .20 


09 .25 .36 
el a af El e] Criterion 


Ficure 4.2 Digrammatic representation of the variances of the tests and criterion 
shown in Table 4.1. 


The correlation between test 1 and the criterion is 


ria = (.7)(.3) + (.0)(-5) + (4)(.6) + (.3)(.0) = .45 


This commonly is referred to as the validity coefficient for test 1. 
It will be observed that the validity of test 1 for this criterion is 
due to its having variance on factors I and III in common with 


the criterion variable. 
The validity coefficients for tests 2 and 3 are 


= (.0)(.3) + (.6)(.5) + (.0)(.6) + (.2)(.0) = .30 
= (.8)(.3) + (1)G5) + 65)66) + (.0)(.0) = .59 


is due entirely to factor II, whereas the 
function of its complexity, since it 


rec 
r30 


The validity of test 2 
validity of test 3 seems to be a 
measures three of the four factors. 

Similarly, the other correlation 


be computed from Table 4.1. 
From the equations for reproducing the validity coefficients of 


the tests some of the reasons why tests are valid, and what must 


be done to increase the validity of a test or a battery of tests, be- 
come apparent. This criterion is weighted in factors A, B, and C. 
Tests which are heavily weighted in these factors will have maxi- 
mum validity for this criterion. To insure maximum validity for 
1 factors present in the criterion should 


and validity coefficients could 


predicting a criterion al 
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be represented in the test by a loading of like sign. The higher 
the loading, the higher will be the validity. If a criterion has 
considerable specific variance, tests should be sought which cor- 
relate with the specific variance and consequently transform it 
into common-factor variance.’ 


PROBLEMS 


Given the following data: 


Variable 
Test 1 
Test 2 
Test 3 
Criterion 


Compute the h? of test 1. 

Which test has the largest specific variance? 

Compute the uniqueness of test 2. 

Compute the table of correlations for the four variables. 
Which test is most valid for the criterion? 

If test 1 were to be altered to increase its validity for the cri- 
terion, on which factor would it be most important to increase its 


loading, and on which factor might it be most desirable to reduce its 
loading? 


By St i be to te 


T See Guilford (1948b). 


Oe 


CHAPTER Ô 


Diagonal and Centroid 
Methods of Factoring 


Te FACTOR problem usually starts with a table 
of intercorrelations which is to be analyzed into factors. The prin- 
cipal diagonal of the correlation matrix, which should contain the 
correlation of a test with itself, is usually vacant. The various 
purposes of factoring call for different values to be inserted in 
the principal diagonal. Most commonly inserted is one of the 


following three values: 


(hj?) of variable j, which is that propor- 


l. The communality 
ariable which is correlated with 


tion of the total variance of a v 


other variables. 
2. The reliability (ry) of variable j, which is that proportion 


of the total variance of variable j, which can be correlated with 
itself. 
3. Unity (1.0) which represents the self-correlation of variable 
j due to all of its variance including common, specific, and error 


variances. 
ying the nature of the funda- 


For the scientific problem of stud 
the communalities are 


mental factors resulting in correlations, 
51 
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usually inserted in the principal diagonal, since they will minimize 
the rank of the matrix, and the factors extracted will best repro- 
duce the correlations. If it is desired to obtain factors which best 
account for that portion of the individual scores that is due to 
reliable variance, the reliabilities should be inserted in the prin- 
cipal diagonal since specific as well as common factors can then 
be obtained. 

If it is desired to obtain factors which best reproduce the actual 
original individual scores with which the factor problem was 
started, unity should be inserted in the principal diagonal cells 
since the factors extracted will account for the error variance as 
well as the common and specific variances. 

The communalities are inserted in the principal diagonal in 
most studies, and, since they usually are not known in advance, 
they must be estimated. In those relatively rare cases where they 
are known in advance or can be very accurately estimated, the 
diagonal method of factoring can be applied. The diagonal 


method is also used as one of the steps in the multiple-group 
method to be presented in the next chapter. 


Tue DIAGONAL METHOD 


The diagonal method of extracting factors from a 
correlation matrix is one of the most simple and direct. Its use- 
fulness in most problems, however, is limited by the fact that it 
requires very accurate estimates of the communalities. The 
method can be applied to tables of correlations of any size. 


Numerical Example. By way of illustration, the correlation 
matrix in Table 5.1 will be factored by the diagonal method. 
The communalities of the variables in this fictitious problem are 


TABLE 5.1. CORRELATION MATRIX WITH KNOWN COM- 
MUNALITIES INSERTED IN THE PRINCIPAL DIAGONAL 


Variable 


(52) 48 -36 -40 : 
-48 (64) .00 -16 #2 
-36 -00 (.81) 63 09 

63 (.53) .25 

225 
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known in advance and have been inserted in the principal 
diagonal. 

The general form of the table of factor loadings is shown in 
literal form in Table 5.2. It will be observed that all the entries 


TABLE 5.2. GENERAL FORM OF THE FACTOR MATRIX 
OBTAINED BY THE DIAGONAL METHOD 


Variable 


to the right of the principal diagonal are zeros. The number of 
factors is unknown at this point in the analysis. Room has been 
left for as many factors as there are variables, although, it is to be 
hoped that fewer factors than tests will be needed. The value 
of an? is equal to h,?, since it will 


be recalled that the communality Vector ans 
of a test variable is the sum of the A 
squares of its common factor load- Axis I PEPA 


ings and the loadings for variable 
l on factors other than I are 
zeros. Hence, 


au = Vie = V52 = .7211 (5.1) 


Geometrically, this is the equiva- 

lent of passing factor I through ficure 5.1 Mlustration of the loca- 
ing l on tion of the first diagonal-method 

test 1. The loading of test axis collinear with test-vector 1, and 


other factors will then be zeros if the second axis orthogonal to the 
first and in the plane determined by 
the other factors are placed orthog- Retest and 2. y 


onal to factor I. 
The loading of the other variables on factor I may then be 


obtained if it is recalled that the correlation between two variables 
is equal to the cross-products of the common- -factor loadings. 
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Thus 
riz = 411821 + 412422 + 413423 + 414424 + 415425 


Since each product after the first involves a zero, the formula 
reduces to 


712 


r12 = 411421 OF 421 = 
ay 


Substituting the known values for fıs and ay; 


a a 
"o Gn Feld 
Similarly, 
T13 36 
43, = — = —— = .4992 
“u an -ØE P 
T14 40 
= = 2 = 1554 
‘~~. a ee 
ris .58 
451 = a = 11 = .8043 


These values have been placed in the first column of the factor 
matrix in Table 5.3. The general formula for obtaining the 


TABLE 5.3, FACTOR LOADINGS DERIVED BY THE DIAG- 
ONAL METHOD OF FACTORING 


Factor 


loading for the jth variable of the first column after the value for 
4; has been determined is 


jg (5.2) 
a1 
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It will be observed in Table 5.2 that all the common-factor 
variance of test 2 is accounted for on the first two factors, or 


E22? + a22? = he? 


This is equivalent, geometrically, to putting the second factor 
orthogonal to the first in the plane determined by the first two 
test vectors. The loading of test 2 on all factors orthogonal to 
the first two will then be zero. Since the values of azı and h are 
known or have been determined, we can solve for ass. 


ass = Vha — azı? = V64 — (.6656)? = .4438 (5.3) 


By using the formula for reproducing the correlation coeffi- 
cients from the cross-products, the remaining loadings on factor 
II may be obtained. 


t23 = 421431 + 422433 + 423433 + 424434 + 425435 


Since each product after the second involves a zero, the formula 
reduces to 


723 — 421431 
r23 = 421431 + 422432 and as = = eo a 
22 


Substituting the known values, 


ro — 21031 _ .00 — (.6656)(.4992) _ 
eam ee 4438 7483 


Similarly, 
_ fa — lam .16 — (.6656)(.5547) aa 


442 


ii 4438 
r25 — Go1@s1 -72 — (.6656)(.8043) 
z = = .4161 
Sod = 4438 


The general formula for obtaining the loading of test j on the 
second factor after loading a22 and the loadings on the first factor 
have been determined is 
foj — @214; 
aj = eet eee (5.4) 


422 
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The values for the second factor have been entered in column II 
of Table 5.3. nia 
When starting on a new factor, the first step is always to fin 
the loading in the principal diagonal cell for that column. For 
factor III , 
31° + a3? + a33? = ha? 5 
and (5.5) 
433 = V h? — (a31? + a2”) 


Substituting the known values 


433 = V.81 — (4992? + .74892) = .0072 


This value is within rounding error of .00. Each successive test 
should, in turn, be interchanged with test 3. If loading ass 18 
within rounding error of zero, regardless of which test is put third, 
there are only two common factors and the analysis is completed. 
If loading aj is zero regardless of which test is placed in the jth 
position, there are j — 1 factors. In our example, ass is within 
rounding error of zero regardless of which of the remaining tests 
is placed third. The rank of the correlation matrix is therefore 
two, and only two factors are needed to account for the correlation. 

If the value of az; had been significantly greater than zero, the 
remaining values in the column would have been obtained by 
means of the formula for reproducing correlation coefficients, 
just as they were in columns 1 and 2 of Table 5.3. The general 
formula for loading ax; (i.e., the loading of test k on factor j) when 


it is a loading to the left of the principal diagonal can be obtained 
from the following formulas: 


Tjk = Ayer F Ajzak +++ ayjany ++ ayer 
All products after the jth contain a zero and vanish and the 


formula reduces to 


Tik = Ajiki F ajapo +++ ++ aj; arji + lja 
and 
ak (ajiak + GjoQng ++ +++ ajj—18kj—1) 


Ajj 


akj 


(5.6) 
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The general formula for finding loading aj (i.e., the loading of 
test j in principal diagonal cell of factor J) is obtained from the 
formulas 


2 2 aia? a? +47 = 
aj? aj2 j3 ni j, j—1 ji h; 
and 


aj = Vhp — (ap? + a2? Fas? Fe F352) 67) 


Summary of Routine for Diagonal Method. 1. Obtain the 
loading of test 1 on factor I by the formula 


an = Vh? (5.1) 


2. Fill in zeros for the loadings of test 1 on the remaining 
factors. 

3. Obtain the loadings of the other tests on factor I from the 
formulas 


r12 
ay = 
411 
713 
Ts eS 
41 
fij 
aj = — (5.2) 
411 


4. Obtain the loading of test 2 on factor II from the formula 


og = Vig? — Gay? (5.3) 


5. Fill in zeros for the loadings of test 2 on the remaining 


factors. 
6. Obtain the loadings of the other tests on factor II from 


the formulas 
723 — 421431 
a E 
422 
T24 — 421441 
Be SS 
422 
Voz — 4214j1 
aj = ———_ (5.4) 


422 
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7. For each column first obtain the value in the principal 
diagonal cell. For test j use the formula 


ajj = Vh? — (an? + aj? +++ + 5, 5-1) (5.7) 
If the value of ay is within rounding-error of zero, try each 
subsequent test in place of test j. If they all give esentially zero 
values for aj, there are 7 — 1 factors. If any gives a value greater 
than zero, interchange it with variable j and obtain the remaining 
loadings on factor J in the usual manner. 
8. The loading of test k on factor J may be obtained from the 
general formula 
rik — (jäki + ajar ++ +++ Aj, 514k, 5-1) 


akj = (5.6) 


fji 


9. The accuracy of the computations may be checked by com- 

puting the communalities and intercorrelations from the factor 
loadings. They should agree with the 
original values within the margin of 
rounding error. 

, 10. The factor loadings of each vari- 
able may be considered its coordinates 
on a system of orthogonal reference axes 

ı and the configuration of test vectors 
plotted. The factor loadings in Table 
5.3 have been plotted in Figure 5.2. 
Since there are but two factors, only one 
2-dimensional graph is needed to repre- 
sent them. The reference axes may be 
rotated any number of degrees in the 

Figure 5.2 Clockwise rota. Plane. A clockwise rotation of 56° 19 

tion ne diagonal method, Y would yield a convenient set of refer- 

ence axes since all factor loadings would 


be positive and in a single quadrant. The rotated factor loadings 
are given in Table 5.4. 


> Moa LO WoO, 


The diagonal method of factoring is theoretically correct and 
can be used with any size correlational matrix. It is very sensitive 
to the values of the communalities, however, and should be used 
only when very accurate estimates of them are available. 
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TABLE 5.4. ROTATED VALUES OF LOADINGS DERIVED 
BY THE DIAGONAL METHOD 


Factor 


Variable 


THE CENTROID METHOD 

The method of factoring the correlational matrix 
which has probably been used the most frequently with experi- 
mental data is the complete centroid method developed by Thur- 
stone (1947c, Chap. 8). Although it does not give a mathemat- 
ically unique, least-squares solution, as does the principal-axes 
method to be described later (see p. 99), it is based on summa- 
tions and is computationally less laborious. 

In order to represent geometrically the steps involved in the 
extraction of centroid factors, let us consider a simple, two-dimen- 
sional example. The configuration of the five tests, whose inter- 
correlations are shown in Table 5.5, is represented in two dimen- 
sions in Figure 5.3. For the two-dimensional case, the first 
centroid axis is passed through the center (or centroid) of the 
configuration of test vectors in such a manner that the sum of 
the positive projections of the test vectors on the second centroid 
axis (which is in the plane and is orthogonal to the first centroid 
axis) are equal to the sum of the negative projections on that axis.* 
After the projections (or loadings) on the first centroid axis have 
been noted, the residual correlations among the tests in the second 
factor (and in the case of r factors in the r — 1 remaining dimen- 
sions) are obtained. In order to extract the loadings on the second 
centroid factor, the direction of some of the test vectors are re- 


null vector as the resultant for the configuration of residual test 
maining after the first centroid axis has been removed. 
rrelation coefficients in the remaining dimensions is 
ple by obtaining the algebraic sum of the 


1 This gives a 
vectors in the dimensions re! 
The sum of the residual co 
zero, This could be verified for the exam: 
projections on axis II in Figure 5.3. 


60 Introduction to Factor Analysis 


versed (reflected). The goal of this process of reflection is to make 
the resultant of the test vectors on the second factor (and remain- 
ing factors if more than two are involved) as large as possible. 
This will give a centroid point as distant from the origin as is 


Tic. 5.3 Location of the first and second centroid factors and residual vectors for 4 
configuration of five tests in two dimensions. 


possible and permit the extraction of a maximum amount of 
variance with the second and succeeding centroid factors. For 
the example in Figure 5.3 this could be accomplished by reflecting 
the residual vectors for variables 4 and 5 on axis II. This would 
result in all of the residual vectors on axis II going in the same 
direction and would maximize their resultant okie The com- 
putation of centroid loadings is a relatively straightforward mathe- 
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matical procedure and does not necessarily require the under- 
standing of the above geometrical considerations. 


TABLE 5.5. CORRELATION MATRIX AND COMPUTATIONS 
FOR THE FIRST CENTROID LOADINGS 


2 

3 (71) .49 

4 (.65) 1.86 

5 .65 (.65) | 1.94 

Ej 1.86 1.94 IZ =9.80 

ta 2.51 2.59 | 7 =13.06 

a .717 | VT, = 3.613862 
1 
-L = .276712 
VA - 


Zaj4 = 3.615 (Check) 


Computation of the First Centroid Loadings. The correla- 


tion matrix in Table 5.5 will be used to illustrate the steps in the 


centroid method. 


1. If, as is usually the case with experimental data, the com- 
munalities of the tests are not known, they may be estimated from 
the data at hand. One simple method of estimating the com- 
munality of a test is to guess it to be equal to the highest correla- 
tion of that test with any other variable in the correlation table.* 
The highest correlation in each column has been inserted in the 
principal diagonal cell of Table 5.5, in parentheses. This value 
is a first estimate of the communality of the test and is always posi- 
tive regardless of the sign of the highest correlation coefficient 
in the column. 

2. Add each column algebraically, omitting the diagonal 
values, and place the totals in the first row, below the matrix, 


labeled “3j.” 


this method of estimating the communality is that the 


be estimated by its projection on the vector closest to it. 
g communalities will be referred to below. 


2 The justification for 
length of a test vector can 
Other methods for estimatin, 
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3. Add each row algebraically, omitting the diagonal values, 
and place the totals in the first column to the right of bie matrix, 
labeled “Xj.” The corresponding row and the column X’s should 
check. The importance of using checks for every step in the 
computations can hardly be overemphasized. They will save 
much time and effort in the long run. ; 

4. Usually the column totals of a table of intercorrelations are 
all positive. However, if any of the column totals are negative, 
it is necessary to go through a process of reflection at this porat 
In the example given in Table 5.5, all column totals are positive, 
and so reflection at this point is unnecessary. The reflection 
process will be explained in connection with the first table of 
residuals, Table 5.6, in which the column totals are negative. 

5. Add the estimate of the communality of each variable to 
the column total of that variable. This row of totals is labeled 
ty (see Table 55): ; 

6. Add the sums across in row tn. The sum of row tj gives 
the value T, which is the total sum of all of the values in the table. 

7. Compute VT; and 1/Y Ti 


8. Multiply each entry in row ta by 1/VTy. The resulting 
values are the factor loadings on centroid factor I (aj) and are 


placed in the row directly below row tj. 


431 = by (=) (5.8) 


9. A partial check on the computations is that Yay = V Ta. 
10. Place the factor loadings vertically in the first column of 
a factor matrix. (See Table 5.8.) 


11. For the first centroid factor, the loadings of variables which 


have been reflected are negative, and the loadings of variables 
which have not been reflected are positive. In the present exam- 
ple none of the variables have been reflected, and all of the load- 
ings on the first factor are positive. 


They have been so entered 
in Table 5.8. 


Computation of the Residual Correlation Matrix. 1. Prepare 


a tabular matrix form and number the columns and rows. (See 
Table 5.6.) 
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2. Place each of the first-factor loadings, without sign, above 
the corresponding column number for each variable. Also place 
the factor loadings in the cells to the left of the corresponding 
row numbers. (See Table 5.6.) When using factor loadings to 
compute residuals, all factor loadings are considered to be posi- 
tive, regardless of their sign in the factor matrix. 

3. Residual correlations are computed by subtracting the 
product of the factor loading of the column and the factor load- 
ing of the row, from the value in the corresponding cell of the 
preceding correlation or residual-correlation matrix. The result 
is placed in the corresponding cell of the new residual correlation 
matrix. The formula is 


1Pjk = Tjk — 41° iki (5.9) 


where ipy, is the first residual-correlation between variables j 
and k. 


Example: To find the residual in cells; (Table 5.6), subtract from 
the value in cell,; of the preceding matrix (Table 5.5) the product of 
the factor loadings of row 3 and column I 


1p31 = -54 — (.816)(.634) = 023 


4. Diagonal residual values are equal to the preceding esti- 


mated diagonal values minus the square of the factor loading of 


that variable. 
5. The algebraic sum of each column (or row) of the new 


residual correlation matrix, Xo, should be equal to zero (within the 
margin of error due to rounding). This serves as a partial check 


on the accuracy of the computations. : 
6. From this point, the process of extracting loadings for the 


next factor is the same as the method described for the correla- 
tion matrix except that it is necessary to reflect some of the vari- 
ables and to re-estimate the communality for each test in every 
residual correlation matrix. The communalities may be re-esti- 
mated by inserting the highest residual correlation of each column, 
regardless of sign, in the diagonal cell of that column immediately 
above the residual communality. The estimated communality 
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TABLE 5.6. FIRST RESIDUAL-CORRELATION MATRIX 


¥ 
(.717) 


* 
(634)  (.753) (816)  (.695) 
4 


(634) -.002 
(753) —.001 
(.816) .001 
*(.695) -.002 
*(.717) d 3 3 —.002 


(Check) 


LL j2=7.634 
Column 4 .042 
5 232 382’ 1.810 

515+ Tz = 2.454 


-329 


has a positive sign regardless of the sign of the correlation coeffi- 
cient from which it is estimated. These re-estimated values will 
not be used, however, until step lla in the next section. 


Reflection. If any of the column sums (omitting diagonal 
values) 3; of the table of residuals are negative, it is necessary to 
reflect certain of the rows and corresponding columns of the table 
of residuals. This will be the case for all residual tables of 
centroid factors. The goal of the process of reflection is to make 
the total algebraic sum of all the values in the table as high posi- 


tive as is possible. This may be accomplished by the following 
steps: 


1. Sum the column totals (32) and place their total (3342) at 
the right of the row of totals. (See Table 5.6.) 

2. Select the column with the largest negative total (column 
4 in Table 5.6). Copy the column total in the row immediately 
below, with the sign changed to positive. Label this row with 
the number of the variable being reflected. 
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3. Place an asterisk at the head of the reflected column and 
also in front of the number of the corresponding row, to indicate 


that the variable is reflected. 

4. In each column, other than the one being reflected, double 
the value of the residual in the row being reflected, change its 
sign, and add that amount to the column total. Enter the new 
sum in the corresponding cell of the next row labeled column 4. 


Example: To get the value in the first column for the row labeled 
“Col. 4,” double the value in the fourth row, first column (—.11)), 
change its sign and add to the total for column 1. 


—.140 + 2(.111) = .082 
5. After all the values in the new row have been obtained in 
they should be totaled. If the computations have been 
correct, the row total should be equal to the total of the previous 
row plus four times the total of the column which has just been 
reflected. This check should be satisfied for each row before 


proceeding to the next reflection. 


Example: —.634 + 4(.169) = .042. 


6. If any of the new column totals are negative, select the 
e total as the next column to be 


this way, 


column with the highest negativ 


reflected. F 
7. Repeat the process in steps 1 to 4 above, using the adjusted 


column totals in the previous row in place of the original column 
totals. However, in columns which have been previously re- 
flected once, the signs are not reversed before adding the doubled 
value (step 4). Lee 

Sometimes in carrying out the process of reflection it is neces- 
sary to reflect a given column (and corresponding row) more than 
once in the same table. For the first, third (or any odd) reflection, 
the sign of the doubled value should be changed before adding 
it to the adjusted column total as suggested in step 4. For the 
second (or any even-numbered) reflection, the sign of the doubled 
value is retained for the addition. 

8. In order that previously reflected columns may be located 
easily, it is advisable to underline successive totals of reflected 


columns. (See Table 5.6.) 
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9. Continue this process of reflection until all column totals 
are zero or positive, applying for each row the check mentioned 
in step 5.° 

10. Change the signs of the values in the correlation or residual 
matrix as follows: 

a. Reverse the signs of all reflected row values not in reflected 

columns. 

b. Reverse the signs of all reflected column values not in re- 
flected rows. 

11. The factor loadings for the next factor are then obtained 

by the steps mentioned previously and reviewed below: 

a. To the last row of adjusted column sums add the re-esti- 
mated communality for each column. These sums are the 
values in row tj. 

b. The sum of the values in row tio is Xtjo or To. 

c. The factor loading for variable j on the second factor is 


aii (z) (5.10) 


12. Place the loadings in the appropriate column of the matrix 
of centroid factor loadings (Table 5.8). The second centroid 
factor is placed in column II, etc. 


13. The signs of the centroid loadings are determined as 

follows: 

a. The sign of a variable that has been reflected once (or one 
reflected an odd number of times) is the opposite of its sign 
on the preceding factor. 

b. The sign of a variable which has not been reflected (or one 
reflected an even number of times) is the same as its sign 
on the preceding factor. For example, if there are four 
centroid factors, a variable which has been reflected once 


in the first and second tables of residuals would have the 
following pattern of signs: 


8 Since the order of selecting variables for reflection is arbitrary, occasionally 
the table total can be made higher by continuing the process of reflection beyond 


this point on a trial-and-error basis (or see Holley (1947) for a criterion for re- 
flection). 
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Factor 
I II Ill IV 
Variable + = 4 4 


14. The second and subsequent tables of residuals are obtained 
by the same procedure as was used to obtain the first one. The 
product of the two loadings is subtracted from the corresponding 
residual correlation in the preceding table with its sign after re- 
flection. The formula for the second residual is 


2Pjk = 1Pjk — 2j24K2 (5.11) 


where 2 stands for the second residual, and the formula for the 
sth residual is 
sPjk = s—1P — 4js@ks (5.12) 


Table 5.7 presents the second residual correlations for the 
example. 


TABLE 5.7. SECOND RESIDUAL-CORRELATION MATRIX 


a, 
AE .063 -.049 -.030 .024 -.008 | .000 
+329 -.049 .025 .017 .002 .005 | .000 
241 .017 .038 -.019 -.006 | .000 
399 024 .002 -.019 -.007 .001 | .001 


-.006 .001 .008 
.000 .001 


.005 
.000 


.379 


15. For small batteries of fewer than 10 variables, the esti- 
mated communalities inserted in the diagonals of the correlation 
matrix should be compared with the communalities obtained from 
the factor loadings. If the discrepancies are large enough so that 
inserting the newly obtained communalities would affect the size 
of the first factor loadings appreciably, the entire analysis may 
may be necessary, using the obtained 


be repeated as many times as t 
ls, until the communality estimates 


communalities in the diagona 
are stabilized. 
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TABLE 5.8. CENTROID FACTOR MATRIX 


Factor 


16. A check on the correctness of the factor loadings can be 
obtained by reproducing the correlations from the factor loadings. 
The discrepancy between a reproduced and original correlation 
should be equal to the corresponding residual correlation in the 
table of residuals computed subsequent to the last centroid factor 
extracted. This check is illustrated in connection with the eleven- 
variable centroid problem (see p. 84). 


Minimum Number of Variables to Determine r Common 
Factors. The minimum number of variables needed to determine 
uniquely r factors is given mathematically by the formula: * 


wit Ver+1 
n= 


2 


TABLE 5.9. MINIMUM NUMBER OF VARIABLES (n) NEEDED 
TO DETERMINE r FACTORS 


ODN OUR WN A] 


10 


Values for n corresponding to various numbers of factors are pre- 
sented in Table 5.9. 


4See Thurstone (1947c, 291-294). 
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By solving the above formula for r we can determine the maxi- 
mum numbers of factors which can be uniquely determined by n 
variables. 


_a+1— V8 +1 


2 


r 


In a factorial study, it is desirable to overdetermine the factors 
by having them represented by more than the minimum number 
of tests. 

Although there are no exact methods for determining the 
number of factors contained in a matrix of empirical correlations, 
several approximate criteria have been suggested and will be pre- 
sented in connection with the eleven-variable illustration in the 
next section. 


EXAMPLE OF AN ANALYSIS OF AN ELEVEN-VARIABLE PROBLEM 
BY THE CENTROID METHOD 

In Table 5.10 are presented the intercorrelations 
of eleven tests which were part of a battery of tests used by the 
U. S. Army Air Forces during World War II. The battery was 
used to classify aviation cadets into training assignments for air- 
crew positions such as pilot, bombardier, and navigator. 


Description of Tests.” (1) Dial and Table Reading. The 
total scores for this test are composed of the scores from two sepa- 
rate parts, Dial Reading and Table Reading, which are contained 
in a single test booklet. Dial Reading measures how quickly and 
accurately the examinee can read the dials on an instrument 
panel. This part contains 57 items; the time limit is 9 minutes. 
Table Reading has two sub-parts. The first consists of a large 
bivariate table in which the examinee is given a pair of marginal 
values and is required to find the entry in the body of the table 
corresponding to these values. In the second sub-part the ex- 
aminee must determine which one of four tables to enter to find 
the pair of numbers called for in the specific item. In Table 


5 For a complete description of these tests including directions, sample items, 
scoring formulas, distributions, reliability and validation statistics, see Guilford 
(1947p). 
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Reading, sub-part I has 43 problems with a time limit of 8 min- 
utes, part II has 43 problems with a time limit of 7 minutes. 


(2) Spatial Orientation I. This is a perceptual-speed test 
using aerial photographs as content. The examinec is required 
to locate small sections of an aerial photograph within a larger 
picture. The test contains 9 large photographs and 49 small cut- 
out photographs to be matched to the large photographs. The 
time limit is 5 minutes. 


(3) Reading Comprehension. This reading-comprehension 
test was designed to measure understanding of paragraph mate- 
rial and ability to make inferences based on material read. An 
attempt was made to minimize mechanical and numerical content 
in the material presented. Material was included on navigation, 
physics, map reading, and astronomy. There are eight such para- 
graphs with 36 questions to be answered. The time allowed is 
30 minutes. 


(4) Instrument Comprehension. The original form of this 
test was the first of the tests used in the aviation psychology p¥° 
gram which require the determination of the location of a plane 
from views of instruments or terrain as seen from the cockpit. 
Because of its validity, this area was explored intensively, and 
other factorially similar tests were constructed. In this test each 
item consists of pictures of two instruments, an artificial horizon 
and a compass, followed by pictures of a plane in five attitudes. 
The problem is to determine which of the five planes has a posi- 
tion and direction consistent with the instrument readings- 
There are 60 items in the test; the time allowed is 15 minutes. 


(5) Mechanical Principles. The idea for this test was derived 
from existing tests such as Thurstone’s Mechanical Movements 
test and Bennett and Fry’s Mechanical Comprehension test. Each 
item consists of a pictured mechanical situation, with questions 
about it designed to test the examinee’s ability to understand 
mechanical forces and movements. There are 40 items; the time 
allowed is 20 minutes. 


(6) Speed of Identification. This is a perceptual-speed test 
similar to Thurstone’s Identical Forms test; it requires the match- 
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ing of airplane silhouettes. The examinee must choose one of 
five silhouettes at the right which corresponds to the silhouette 
on the left. The test consists of 12 sets of planes with a total of 
48 items. The time limit is 4 minutes. 


(7 and 8) Numerical Operations I and Numerical Operations 
II. This is a simple numerical-computation test, scored in two 
parts for this analysis. The 100 problems printed on the front 
of the I.B.M. sheet (test 7) involve addition and multiplication. 
The 80 problems on the back (test 8) involve subtraction and 
division. The time limit for each test is 5 minutes. 


(9) Mechanical Information. This is a verbally stated mechan- 
ical-knowledge test, relating particularly to the operation of parts 
of automobiles. Most of the items are quite brief, calling for 
only a limited amount of reading and requiring quite specific 
mechanical knowledge. There are 30 items; the time allowed is 


12 minutes. 


(10) Practical Judgment. This test requires the solving of 
verbally presented problem situations. A number of bits of fac- 
tual information are presented; the examinee is required to de- 
termine the most practical course of action among the five pro- 
posed alternatives. There are 30 items; the time allowed is 30 


minutes. 


(11) Complex Coordination. This is an apparatus test of the 
speed and accuracy of hand and foot adjustment to a complex 
perceptual stimulus. The examinee operates controls similar to 
those used in an airplane in flight. The subject faces a stimulus 
panel on which there are three rows of red lights and three cor- 
responding rows of green lights. A particular pattern of red 
lights, one light in each row, is presented to the subject. His task 
is to move his controls so as to turn on the green light correspond- 
ing to each of the red lights. As soon as each of the three red 
lights is matched by the corresponding green light, a new set of 
red lights is automatically presented. The total test time limit 


is 8 minutes. 


First Centroid Factor Loadings. The centroid factor loadings 
of these tests are obtained from their intercorrelations as follows: 
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1. The highest correlation coefficient in each column is placed 
in the diagonal cell of that column as an estimate of the com- 
munality. (See Table 5.10.) 


TABLE 5.10. CORRELATION MATRIX? 


Test 1 E 3 8 5 6 7y OF 0n L 


y1 
1. Dial ond Toble Reoding.....f.55) 40 .41 .37 .20 .32 .52 .55 .02 .27 .36 |3.42 
2. Spotiol Orientation 1...+.+. 


+40 1.46) 18 .28 .15 .46 .23 .23 .01 .14 .30 | 2.38 
3. Reading Comprehension ss.. |. 41 .18 (.41) .33 .32 .18 .29 .35 18 .41 .2012.85 
4. Instrument Comprehension.. |.37 .28 .33 (.38) .38 .28 .17 .22 .22 .26 .36 | 2.87 
5. Mechanical Principles .....].20 .15 .32 .38 t.47) 16 -.02 .08 .47 .33 .31 | 238 
6. Speed of Identification .... |.32 .46 .18 .28 16 (.46) 45 .18 07 .14 .28 | 2.22 
7. Numerical Operations I....].52 .23 .29 17 -.02 .15 (.67) .67-.13 .13 .16 | 2.17 
8. Numerical OperctionsH...].55 .23 .35 .22 08 .18 .67(.67)-.06 .20 .12 | 2.54 
9. Mechanicol Information...|.02 .01 .18 .22 .47 .07 -.13 -.06 (.47) .25 .18 | 1.21 
10. Practical Judgment .......]-27 .14 41 .26 .33 14 .13 .20 .25 (.41) .20 | 2.33 
11. Complex Coordination ..... +36 .30 .20 .36 .31 .28 .16 .12 18 .20(.36)|2.47 

Ej [342 238 285 287 2.38 2.22 2.17 2.54 1.21 233 2.47 |26.84| 6=Totol Negs- 
(15,))| 4% =5.670097 
#1 [3.97 2.84 3.26 325 285 2.68 284 3.21 1.68 2.74 283 32.15 = 7, jp =.176364 
aj, |700 .501 .575 .573 .503 .473 .501 .566 .296 .483 .499 |5.670 = Łojı 


No.of Negotives 


con=no0-=0000 


sql [3.42 238 285 287 242 222 2.47 2.66 1.59 233 2.47 27.68 = EE 


45.31 =Eh 
32.99 Reest. 
4 13.084 
1 * 32.99 

o 
$, =.397 
6 


Í 1 These are product-moment correlation coefficients computed fom normalized 
distributions. They were obtained from Guilford (1947b, Appendix ©), and they 
are based on a sample of 8,158 unclassified aviation students. 


2. The sum of the correlation coefficients in each column 
(omitting the value in the diagonal cell) is entered below the 
column in the row labeled “Sy.” As a check each row is added 
across and the sum placed in the column to the right, also labeled 
“ya” i 

3. Since all column sums are positive, it is not necessary to 
reflect any of the variables at this point in the procedure. 

4. To the total of the correlation coefficients in each column 
is added the estimated communality and the sum placed in the 
next row labeled “ta.” 

5. The column totals are summed across and the result is 
called “Tı.” (Ty = Sty.) This value is the sum of all the entries 
in the correlation matrix, including the diagonal values. 


ia 
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6. Calculate VT; and 1/ VT. 

7. Obtain the loadings of the tests on the first centroid factor 
by multiplying each variable in row tj, in turn, by 1/V Tx, (an = 
ta:ļ/V T). If it is intended to calculate factor loadings to two 
decimal places, the calculations should be carried out to at least 
the third decimal place and preferably the fourth to minimize 
rounding errors. È 

8. Although, for convenience, the loadings have been obtained 
in a row, they are the first column of the matrix of centroid factor 
loadings and are entered as such to two decimal places in the first 
column of Table 5.18. Since there were no reflections of vari- 
ables, the loadings on factor I are all positive. 


Second Centroid Factor Loadings. The calculation of the 
second centroid factor loadings will be described under the follow- 
ing headings: residual correlation matrix, reflection, and calcula- 
tion of loadings. f 


A. Residual Correlation Matrix. 1. The loadings on centroid 
factor Ihave accounted for a portion of each correlation. Now 
we wish subtract the portion of each correlation which has 
been accounted for by the first centroid factor and determine the 
residual correlations. It will be recalled that, for orthogonal 
factors, Tje = ajri + jlr +*+ Ajrâkr. The first factor ac- 
counts for (aj1:4,1) and the first residual correlation (1p;,) is equal 
tO rp, = Tje — Qrer = Ajodna +++ Grr Therefore, to deter- 
mine the first residual correlation between variables j and k, the 
product of the loadings of the two variables on factor I is sub- 
tracted from the correlation between them. ‘This is done sys- 
tematically in Table 5.11 where all of the first residual-correlation 
coefficients are shown including the residual communalities in 
parentheses. 

2. A partial check on the accuracy of the computation of the 
residuals is that the sum of each column (or row), including diag- 
onal values, should be zero (or within a rounding-error margin). 
The sums are placed in a row (or column) labeled “Xo.” 


6 The reason for expecting this zero sum is that the first centroid-factor vector 
is located so that the sum of the residual correlations for each variable is zero. 
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TABLE 5.11. FIRST RESIDUAL-CORRELATION MATRIX * 


è E 2 sA) 
aj 700 501 575 5% 503 473 50 566 296 483 499 
Test 1 2 3 4 5 6 7 8 9 10 “n Eo [No.of Negatives 
No. = 
a ee == 
+| 700 | 1 (60) 049 008 7031 “152-011 169 154 “187 t068 011 | 002| 3 
223 + + + + + Ši 
a| 501 | 2 |049 12092108 Ż007 *102 223 -021 -054 “138 “102 “050 -001 | 3 
a a gaa 
575 | 3 |008-108 (079) 00! 031-092 002 025 010 132 -0987 | 001 4 
4 ; 17 i 
573 | 4 [Żo31 +007 001 (052) 092 0097117 *104 050 -017 074| 002| 2 
321 
+ Cc =. + + + 
503 | 5 [152 =102, 031 092. (217)-078 -272-205 321 087 059 |-o02 | o 
; 2 223 - i 
*| 473 6 [ON . 223 tose 009!*078 (236)-087 -088 *o70 tose 044 |-002 5. 
- + 386 
al 501 | 7 | 169-021-002 *117 *272 -087 (419) 386 “278 ~112~090 Loo | 3 
= + 386 4 4 + 
*| 566 8 154 -054 *025 -104 -205 -088 386 (350)-228 -073 -162 | 001 3 
+ + + + + 321 
296 | 9 |t1e7'*138 oio 050 321 “070 *278 228 (3682) 107 032 | 001] o 
132 
483 | 10 toes *102 132 -017 087 oes *112 Ż073 107~ (1771-041 | oo2| 2 
Met 7 162 ; 
499 o1- 050-087 074 059 044 $090 162 _032 -041 (iM | 001| ‘5 
Zo|002 -001 001 002 -002 -002 -001 001 001 002 001 |EEa| 30=Totol Negs. 
j2l-058 -210 -078 -050 -219 -238 -420 -349 -381 -175 -110=|-2.288 
Column 7 [396 -168 -082 184 325-064 420-1121 175 049 070 |-608| 
8 þ-704-060 -132 392 735 12 1192 m21 631 195 394 |3.876 
1 |204 -158%-148 454 1039 134 1.530 1.429 1.005 331 372 |6.692 
2 |802 158 068 468 1.243 -312 1.488 1.321 1.281 535 272 |7.324 
6 |780 604 252 450 1.399 312 1.314 1.145 1.421 711 184 8.572|_ 58 
4,|967 827 384 567 1.720 535 1.700 1.531 1.742 843 346 h1.162=% SER 
a| 289 248 115 170 515 160 509 458 521 252 104 [|3.341=Io, a 299 


- \¥al] 840 854 496 502 -1.399 790 1.534 1.479 1421 827 650 |10.792=ZIZ,al 10792=1lzl Paes 
42.292=Ih g +2590 hRet — 5 
Resid: {3302 $25. 

7 


13.084 


1In this and in the succeeding residual-correlation matrices, all entries without 
decimal points are to three decimal places, 


3. Next the sum of each column of the residual-correlation 
matrix, without the value in the diagonal cell, is obtained and 
placed in a column labeled “3;2.” These column totals are usually 
negative. The sum of these values (23;2) is placed to the right 
of the row and represents the total of the residual correlations 7 
the table not including the residual communalities. If any OY all 
of the column sums referred to are negative, a process of reflection 
of variables is begun to make the value of (3342) as high positive 
as possible. 


B. Reflection of Variables. 1. The order in which variables 
are reflected is arbitrary, the aim being merely to make the sum 
of the residual coefficients as high positive as possible in order tO 
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maximize the amount of variance which will be extracted in the 
next centroid factor. Although other criteria (e.g., column with 
largest number of negatives, highest communality, or highest ab- 
solute sum) for selecting the columns to be reflected could be 
used, we shall follow the principle of reflecting the column with 
the highest negative sum. In the first table of residuals, column 
7 has the largest negative sum in row 3j2 and variable 7 is re- 
flected first. 

2. Row 7 and column 7 are starred to indicate that they have 
been reflected. In the next row (labeled column 7 in Table 5.11), 
the sums of the columns, with the reflected values of column and 
row 7, are entered according to the procedure outlined in steps 
4 through 9 in the previous section (see p. 65). 

3. After these adjusted column totals have been obtained, they 
are summed and the total placed in the 33; column to the right 
of the row. A check on the correctness of the computations may 
be obtained by adding four times the value of the adjusted sum 
of the reflected column to the preceding Sj: total. (In the ex- 
ample, —2.288 + 4(.420) = —.608, which is also the row total.) 
The total of the reflected column is changed from its minus value 
to the corresponding plus value and the same is true of the cor- 
responding row total, hence the increase in the table total by 
four times the value of the reflected column sum. 

4. The variable with the highest negative, adjusted column 
total (in the example column 8) is reflected next and so on until 


all the column totals are positive or zero. 


C. Calculation of Loadings. 1. The re-estimated communality 
for each variable (equal to its highest residual correlation, but 
positive in sign), which has been placed above the residual com- 
munality, is added to the last adjusted column sum, and the total 
placed in row ty. 

2. The factor loadings are then obtained by steps 11 and 12 
of the preceding section (p. 66). 

3. The signs of the reflected variables in the residual correla- 
tion matrix are changed according to the rules in step 10 (p. 66). 

4. The second centroid factor loadings are entered in the sec- 
ond column of Table 5.19 with proper sign. Since none of the 


\e 
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TABLE 5.12. SECOND RESIDUAL-CORRELATION MATRIX 


= x * 


A 289 248 115 170 515 160 509 458 521 252 104 | 


Test 4 6 7 10 11 | Eo [No.of Negotives 
No. 1 2 3 5 8 9 0 g 


one oe tosi -018 003 *057 022 022 036 *005 -041 | 001 2 


+023 a 079 +035 t026 183 “147 *168 009 040 + t076 [003 1 


=041 079 aa pis *oza 074 *061 *078 *050 103 toa9 -001 Q 
170| 4 -o18 +035 toig oa oo to36 030 026 -039 $060 056 |-003 2 
515 | 5 | 003~026 2028 004 (oat 004 010-031 053 +043 005 |-001 1 
x| 160 | 6 [057 183 074 +036 t004 dgnt Ei *161 =013 0487061 | 002 0 
509| 7 | 022 “147 7061 030 010 +168 eh 153 013 *016 037 | 000 o 
458 | 8 | o22 +168 +078 026 -031 tiei 153 di p toa2 114 | 000 2 


Koos oso 103*060*043 048 *016 $042 ~o2a (ooto? oo2 | o 
+ 4 
104 | 11-041 +076 *099 056 005061 037 114 -022 ~067 (151) 


003 2 
Eo| 001 -003 -001 -003 -001 002 000 000 002 002 an | Ts) 14 =Total Negs. 
-102 -164 -120 -0% -057 -195 -127 -176 -048 -066 -154 |-1.300 
Column 6 | 012 -530 -268 -019 -049 195 209 146 -022 -162 -032 |-520 
058 530-426 051 003 561 503 482 -040-242 120 | 1.600) 
140 688 426 089 059 709 625 638 060-448 318 |3.304 
150 768 632 209 145 805 657 722 108 448 452 bes 


521 | 9 bo “009 +050 -039 053 žog 03 -011 002a -022 | 002 4 


4 
735 269 198 988 825 890 161 551 566 |6.341=73 apean 
a3 | 082 378 292 107 079 392 328 353 064 219 225 |2.519 =1a;3 w ae 


= 3.32 

323 207 806 aap 578 |5780 =1lza] 5780=11%l a Fos 
+1296 Be 41245 Eh? pr 

7076 Resid 7.025 Reest 65. 473 


3 


variables were reflected in the extraction of the first centroid 
factor, factor loadings for all variables reflected in the table of 
first residuals have negative signs. A new table of residuals 
is made up by subtracting the products of the factor loadings 


just obtained from the preceding residual matrix with changed 
signs for reflected variables. 


Third and Subsequent Centroid Factors. The routine for 
extracting the third and each subsequent centroid factor is a 
repetition of that used to extract the second centroid factoT- 
The major steps are as follows: first, obtain the residual co- 
relation matrix as in Section A, above; second, reflect some of 
the variables so as to maximize the positive sum of the ann 
of residual correlations (excluding the values in the principal 


Diagonal and Centroid Methods of Factoring viel 


TABLE 5.13. THIRD RESIDUAL-CORRELATION MATRIX 


* * s+ + . 
a3 082 378 292 107 079 3% 328 353 064 219 225 
Rese 1 2 3 4 5 6 7 B 9 10 11 | Eq |No. of Negotives 


059 = 

(0501 $008 o17 to27 -003 7025 $005 Ż007 031 -013 +059 | oo1 | 3 
4: 

toos (a40t031 -005 to04 035 023 035 +033 $043 -009 | 0o00 | 2 

039 033 | oo0 | 1 


040 
017 tos: (018) Żoi2 005 7040 *035 $025 03 
6 


04 à 

027 -005 tor (0491 $004 -006 -005 -012 $046 “037 032 | ooi | 5 

p Ste ig ‘ 

003 tooa 005 4004 (047) toz7 to16 *os9 048 026 *013 | 000| 1 

be ceed 040 

+1392 | 6 |7025" 035 t040 -0065 S027 (029) 09 os toe tose -027 | 001 | 3 
056 

1005 023 £035 -005 £016 039 (060) 037 +008 Ž056 -037 |-003 | 2 

059 

007 035 *025 o12 $059 023 037 0231 *034 žo35 035 | oor | 4 
Ei 


031 fox 031 to46 049 *o12 Ž008 *034 (049) 010 *036 | 000 


+) 353 


o 

+ - ae ee 056 _ 
013 toa3 039 "037 026 *038 =056 =035 010 (055) 018 | 000 | 3 
5 


+ ” + ne 
059 -009 ~033 032 t013 -027 -037 035 ~036 018 (063)| 000 
001000 000 00! 000 001-003 001 000 000 000| EE,,*| 26-Total Negs. 
-049 -040 -018 -048 -047 -028 -063 -042 -049 -055 -063 |-502 
069 -022 -094 -112 -021 026 011 ~112 -023 -091 063 |-250 
083 -092 -034 -098 097 -020-063 112 091 -021 133 | 198 
099 092 028 -078 105 -090-109 182 157 065 15| 566 
409 138 098-068 137 -168 109 256 173 177 041 |1002 
059 208 178-056 191 168 187 302 197 253-013 |1674 
113 198 202 056 199 156 177 278 289 179 051 |1898 
m72 241242 102 258 196 233 337 337 235 10 |2463 =74 JT, =1.569395 
MO 154 154 065 164 125 148 215 215 150 070 |1.570 =Eo4 Jer 837108 


*| 225 


lza] 195 226 268 186 205 272 261 302 289 315 299 |288 =Elgal 2.818 =25,41 gare 
3 


diagonal), as in Section B above; and third, calculate the cen- 
troid loadings, following the steps in Section C. The computa- 
tions are illustrated in Tables 5.12 to 5.17. The loadings for each 
factor are entered in the corresponding column of Table 5.18, and 
the communality (h?) for each variable is computed. ‘i 


Criteria for Sufficient Factors. There are no exact criteria 
for stopping extraction of factors. A number of empirical cri- 
teria, however, have been developed, several of which will be 


described." 
A. Tucker's Phi. This criterion uses the principle that if 
there is no significant decrease in the size of the residual values 


7 Twenty-five criteria are listed by Vernon et al. (1949). See also Burt (1952). 
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TABLE 5.14. FOURTH RESIDUAL-CORRELATION MATRIX 


> * . * . 
Oja nO 154 154 065 164 125 148 215 215 150 070 
Test 1 2 3 4 5 6 7 8 9 10 u 


ÁÁ 
Zo |No.of Negatives 


110} 1 |<G27%009 000 020 -021 +039 ton 017 007 2030 051 
009 (019) 007 tors *o21 016 000 ooz 000 0204020 o 
154] 3 |000 007 (0617002 $020 021° or -008*002 0o16 Żosa|oo0| 2 
sloes| 4 | 0204015 “ove (629)-007 tora tois 4026 032 +047 027 |-001 | 2 
eliga] 5 oat toz1 4020 -007 103217007 4008 ~oza 013 “001 002 | ooo | 5 
039 3 

o 


125| 6 039 016 021 *o14 “007 (024) gat -0047015 019 -036 | 000 


Foi 000 o12 +015 +008 ozi (034) 995 toza 034 £047 | 001 


215 O17 002 -008 *026 “024 -004 005 (0132012 003 ~020 | ooo 


5 
6 
148| 7 
8 
9 |007 000*002 o3 013 7015 +024 *o12 1008/4022 o21 | 000 
150| 10 F030 020 o16*047 “oor 019 034 0034022 (eaitoz9 -001 
«|070 


e osi 
*020*044 027 002 *o36 +047 7020 021 $029 (o= l-001 
-001 000-001 002 000 001 000 000-001 -001 | rE- 
-020 -016 -043 -030 -024 -033 -013 -002 -035 -055 


18=Total Negs. 


Column 11 }-151 020 072 -097 -034 048 061 -053-044 023 
038 072 -137 008 126 083 -019-058 083 
068 068 137 O22 154 113 033-122 177 
068 072 201-004 184 161 057 122 221 
110 112 187 004 170 177 009 148 219 


131 156 234 028 209 224 035 180 266 308 [1.985 =7, ae 
093 111 166 020 148 159 025 128 189 219 [1.410 Los Ms -709 


MO 132 205 124 192 177 121 148 22 297 h.932 =rlțs| 1,932 =El£ysl ay eet 


2 
+.317 =Ehp 
Zza Reid 5 


2 
+.429 “Eh 
aget Reest #7537 


4 


from one matrix to the next, the significant common-factor 
variance has been extracted. The steps are as follows: 


1. Add all of the values for a residual matrix (s), disregarding 
signs, and using the re-estimated values of the communality 1 
the diagonal cells. 

_ 2. Add all of the values for the next residual matrix (s+ 1), 
disregarding signs and using the residual communalities in the 
diagonal cells. 

3. To obtain phi, divide the value obtained in step 2 by the 
value obtained in step 1 above. The formula for phi is: 


des = Z| ps4 | L Z| Zio | + ZA resia. 
A SI pa | Z| Dis | + Zh?re-est. 
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TABLE 5.15. FIFTH RESIDUAL-CORRELATION MATRIX 


EE es + A 
4s 152 093 m 166 020 148 159 025 128 189 219) 
a e sase? S o w n gs emae 


152] 1 
+Jo93| 2 foos (ora) -003 000 019 002 -015 000012 “003 000 | 001 | 3 
«lin | 3 [£017 -003 @3aæżo20 ois 004-006 ton Žoi2 Ž005 020] 000] 2 


O24 + - 
0281+005 +017 -005 -024 017 *o13 013 -012 001 “018 | 001 4 


022 
-005 000020 corn)" 019 ton ton ozz on o16*o09| 002] 1 
25 


= = O12 

-012 tor tore on ~010*004 7004 009 (161-002 +007 001 
189| 10 | 001 “003005 016 +005 +009 “004 -002 -002 Oton 000 
2 ‘018 000 020 4009-003 004 o12 +025 Ż007 *013 1003) 
001 000 002 000 000 000-001 001 000 000 | IE e| 
-011 -032 -017 -024 -017 -022 -026 -015 -012 -003 
Column 3 | 007 -005 032 023-060-025 -010 -004 009-002 -043 

5 | 055-043 068 043 060-007-020 046-011 008 -037 
065 043 062 043 098-011 O10 046 013 002-037 
029 043 102 061 092 -019 -014 096 027 028 037 
6 oos 047 110 083 074 019-008 112 035 046 045 
7 | 021 017 098 105 084 013 008 110 027 038 069 
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A= Total Negs. 


45|045 036 118 127 109 030 023 135 039 054 094 J7= 900000 

ayg| 050 040 131 141 121 033 026 150 043 060 104 ganun 
6 

lx, 059 n6 115 071 074 116 083 1.058 =zl5el 1.058=zlE el A= 914 

4.210 =EA*Resid. t220=£A Reest. $ 1278 

re 1.278 es 15 


4. Compute (n — 1)/(n + 1) where n is the number of test 
variables in the matrix. 

5. If ¢ exceeds (n — 1)/(n + 1) in value, there are s significant 
factors. The adequacy of this criterion depends, in part, on the 
adequacy of the reflections. If the reflections for a matrix have 
not resulted in the highest positive sum, phi may exceed the 
value (n — 1)/(n + 1) and then drop below it again on a succeed- 
ing factor. This criterion also involves considerable additional 
computation, since to show that there are s significant factors re- 
quires s + 1 computed factors and s + 1 tables of residuals. (See 


Thurstone 1938 and Blakey 1940.) 


B. Humphrey's Rule. This criterion takes into account N, 
the size of the sample, and is dependent on the loadings of only 
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TABLE 5.16. SIXTH RESIDUAL-CORRELATION MATRIX 


pA 050 040 131 141 121 033 026 150 043 060 104 
Test > i ge 6 8 9 10 1 | Zo [No.ofNegotives 
No.m g n | Fo 


923 aay + M a 
(022) 003 010 -012 018 -019 012 005 -014 -002 -023 | 000 1 
016 


003 10ih žo08 2006 014 “001 -016 -006 “0104005 to04| ooo | 4 
7o10 too 1009) 002 “002 000 4009 *o09 006 -003 006 | ooo | 3 
12 t006 002 1002Ž007 008 “007 “001 005 008-006] ooo | 3 
ore ora “ooz 4007 i0i0ižor3 002 007 fots to02 to16 | ooo | 4 
o33| 6 Hoi “oor 000 006 to13 iore)4004 “003 003 007 001 | 001 | 2 
012 -016 $009 7007 002 +008 (518) -005 *005 *006 “009 |-o01 | 4 
»|150| @ | 005 -006 $009 “oo! 007 ~003 -005 TA o11 7009 |-001 
Toro 006 005 *o15 003Ž005 “003 (018i -005 003 | 001 
060| 10 ooz toos -003 oos oo2 007 +006 tom -00s (012) 007 | 000 
023 *004 006-006 *o16 001 "009 “009 003 007 (012) | 000 


000 000 000 000 000 001 -001 -001 001 000 000 IL 9=| 32=Total Negs. 
ZT ;7/022 -017 -003 -002 -010 -015 -015 -003 -009 -012 014 |-122 


Column 1 | 022 -023 -023 022-046 023-039 -013 019 -008 032 |-034 

2 |058 -051 -027 036 046 049 -043 -027 049-004 064 | 150 
O51 -011 048 074 047-011 -015 029 006 072| 354 
039 007 046 088 041 -001 015 023 028 054 | 414 
007 025 032 092 049 001 005 033 040 036 | 418 


15 
023 035 044 110 068 017 016 048 051 059 |.592 =77 Jae 7694 


A= 1.299688 
030 045 057 143 088 022 021 062 066 077 |.768 =o, Wy 


wnwa 


-121 
073 055 060 096 057 075 059 069 056 074 |.792 =ElĘ;}l .792=EIE,71 4 ad 


2 2 t 
4.122 =E} Resid. +174 =Eh Reest. 
"914 0.966 42.753 


two variables (which should be sufficient to establish a factor) 
rather than on the entire matrix. 


1. Find the product of the two highest loadings in a column 
of the centroid factor matrix. 


2. Find the standard error of a correlation coefficient of zero» 
for the type of correlation and size of sample being used (€-8+ 
1/YN for the Pearson product-moment 7). : 

3. If the product found in step 1 above does not exceed twice 
the standard error found in step 2, the factor is probably not 


significant. This rule works best for samples that are not too 
large. 


C. Coombs’ Criterion. This criterion is applicable only to 
analyses in which the table of correlations contains only positive 
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TABLE 5.17. SEVENTH RESIDUAL-CORRELATION MATRIX 


a, 157 030 043 057 143088 022 O21 062 066 O77 | 
poa 4 4 5 & F B 9 10 
457 | 1 [-002) -002 -017 003-005 005 009 002 004 -008 01 
030] 2 |-002 (015) 007 004 010 -004 -017 -007 -012 003 002 
045} 3 |-017 007 (008)-001 -009 -004 008 008 003 -006 003 | 000 
057] 4 | 003 004-001 (009) 001 001 -008 -002 001 004 -010 | 000 
143} 5 |-005 010-009 -001 (-003) 000 -001 004 006 -008 005 |-002 
osa | 6 | 005 -004 -004 001 000 (014) 002 -005 -003 001-006 |-002 
022 | 7 | 009 -017 008 -008 -001 002 (016)-006 004 005 -011 | 001 
021| 8 | 002 -007 008 -002 004 -005 -006 (011)-004 010 -011 | 000 
062 | 9 | 004 -012 003 001 006 -003 004 -004 (011) -009 -002 |-001 
066 | 10 |-o08 003-006 004-008 001 005 010 -009 (007) 002 | 001 
077/44 | 011 002 003 -010 005 -006 -011 -011 -002 002 1017) 
Zo | 000-001 000 000-002 -002 001 000-001 001 000 |z 
066 075 066 034 048 031 071 059 048 056 063 | .617 


4.410 = EA? Resid. 
Tet 


or zero values. Small negatives, not significantly different from 
zero, are permissible. It depends on the patterning of residuals 
rather than on their size. It is assumed that, if there are significant 
factors, rather than merely error variance, remaining in the table 
of residuals, there should be no more negative values in the 
residual matrix, after reflection, than could be expected by chance 
from a residual table derived from a correlation matrix with posi- 
tive manifold. 

Count the number of negatives remaining in a table of resid- 
uals after reflection. If that number is not significantly different 
from the value of “C” obtained from Table 5.19, for the number 


of variables, sufficient factors have been extracted (Coombs, 


1941b). 

None of the criteria mentioned above are exact and, since they 
are based on different principles, they may give contrary indica- 
tions. It is probably best, in practice, to get indications from 
several of them and to use that information to help make a judg- 
ment of the number of factors that will lead to meaningful results. 
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TABLE 5.18. CENTROID FACTOR MATRIX (F;) 


No. of 
Reflections 
1. Dial and Table Reading.... 2 
2. Spatial Orientation I.s... 3 
3. Reading Comprehension... 1 
4. Instrument Comprehension . 2 
5. Mechanical Principles s... 1 
6. Speed of Identification ... 3 
7. Numerical OperationsI... 2 
8. Numerical Operations I... 2 
9. Mechanical Information. .. 1 
10. Practical Judgment...... : 
11. Complex Coordination. ... | (oes: See 


Sum of Cross- 


Correlation 


Products + Resid.| from Table 10 
with Adjusted Sign 


Application of the Criteria for Sufficient Factors. In orde 
that the various criteria described above for determining whe" 
to stop extraction of factors may be applied, it is necessary tO ob- 
tain the following sums for each correlation matrix: 


l. Tucker’s phi requires the absolute (non-algebraic) sum of 
all the entries in the matrix, excluding the diagonal cells. This 35 
obtained first by columns and entered in row | Six |- (See Tables 
5.10 through 5.17.) The sum of these values, 3| Sn |> is added 
first to the sum of the residual communalities and later the sum 
of the new (re-estimated) communalities. Tucker’s phi is then 
calculated by the procedure described on pages 77-79. 
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TABLE 5.19. CRITICAL VALUES OF COOMBS’ CRITERION 
AND THEIR STANDARD ERRORS FOR BATTERIES OF FROM 
TEN TO FIFTY VARIABLES? 


A 


38 421 592 19 
39 42.2 6235 19 
40 42.3 660 20 
41 424 695 20 
i S “732 fa 
43 42.6 769 21 
44 427 808 22 
45 428 847 22 
46 429 888 23 
47 43.0 930 23 
48 431 972 24 
35 41.8 497 17 |” 43.2 1016 24 


oowoo sy oaoa 


20 39.3 149 
21 39.5 166 
22 39.8 184 
23 40.0 202 


1 From Coombs (1941b) 


n = Number of variables. 4 . X F 
% = Percentage of negative entries in the residual matrix after sign changé. 
C = Number of negative entries in the residual matrix after sign change. 


s, = Standard error of C. 


3 


6 


36 4.9 528 18 50 43.3 1061 25 
42.0 559 18 


2. The number of negative entries in each table of residuals, 
after the signs of reflected variables have been changed, should 
be obtained if it is desired to apply Coombs’ criterion. The num- 
ber of negatives in each row of the residual matrix is shown at 
the right, beginning with the first correlation matrix in the exam- 
ple. The number of negative entries in the entire table is shown 
in the sum at the bottom of the column. The application of this 
criterion is explained on pages 80-81. 


Table 5.20 summarizes the data which have been obtained to 
help decide whether all of the common-factor variance has been 
extracted from the correlational matrix. Tucker’s phi never does 
reach its criterion value in the example but does show a large 
increase for the sixth factor and maintains it for the seventh 
factor. This leveling off is often more indicative than the size 
of phi. 

Although the number of negative signs comes within two 
standard errors of C for an eleven-variable matrix, at the second 
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TABLE 5.20. DATA FOR APPLYING THE CRITERIA FOR 
SUFFICIENT COMMON FACTORS TO THE ELEVEN-VARI- 
ABLE MATRIX CRITERION 


Coombs’ 
Criterion 


Tucker's 


Factor Phi 


Criterion 
Value 


factor, it drops again and does not reach the critical value again 
until the sixth factor. All three of these approximate criteria 
seem to signal that the sixth and seventh factors are not significant 
and, hence only five factors should be retained for rotation and 
interpretation. 

Since the criteria are not exact, it is frequently necessary, 17 
practice, to extract more than the minimum number of factors 
indicated by the criteria and determine by rotation how many 
factors are needed to give a meaningful solution. If the rotations 
are started with too many factors, some of them may be “resid- 


ualized” (i.e., achieve small loadings only, usually between +0.20) 
during rotation.’ 


Checking the Centroid Loadings. The centroid loadings 
rounded to two decimal places, are recorded, with proper sign, 
in Table 5.18. A partial check on the accuracy of the centro} 
loadings may be had at this point, by reproducing the original 
correlations, following the steps below. If reproducing the entire 
matrix is considered too laborious, only the correlations in the 


> i „ni A 
8 A few of the other approximate criteria which have been used for significan¢ 
of factors are: 


1. Obtain the absolute sum of the residuals, excluding the diagonal entries. 
This value should level off abruptly after the significant factors have been €% 
tracted. (Mosier, 1939b.) 

2. Make a frequency distribution of the coefficients in a table of residuals. 
Compare the standard deviation of the distribution with the standard error ° 
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diagonal above the principal diagonal need be reproduced (celly, », 
cells,z, etc.) and the cell in the upper right-hand corner of the 
correlation matrix (celli,1:) since that will use the loadings of 
each test twice. 


1, Obtain the cross-products of the loadings of each variable 
with its neighboring variable [e.g., from Table 5.18, for variables 
1 X 2: (.70)(.50) + (—.29)(—.25) + (—.08)(.38) + (—.11)(—.15) + 
(.15)(—.09) = 0.3951]. 

2. Determine the number of times each variable was reflected. 
(This can be done by counting the number of changes in the signs 
of the centroid loadings of a test. Thus, in Table 5.18, test 1 
changes from positive to negative on factor II and continues nega- 
tive until it changes back to positive on factor V, indicating 2 
reflections for test 1 throughout the 5 factors.) If the combined 
sum of the number of reflections for the two variables whose cor- 
relation is being reproduced is an even number, the residual after 
the last factor (fifth residual matrix, with signs prior to reflection, 
see Table 5.15 in the example) is added algebraically to the sum 
of the cross-products. If the sum of the number of reflections of 
the two variables is an odd number, the sign of the residual is 
changed before adding it to the sum of the cross-products. 

3. The resulting sum (rounded) should equal the correlation 
between the two variables. This work is carried out systematically 
for the example in the lower part of Table 5.18. 


PROBLEMS 


1. Factor the correlation matrix on page 17 by the diagonal method, 
using the following exact values for the communalities: 


Av = 58 hs? = .25 
hg? = 81 he? = .36 
hg? = .52 h? = .73 
hy? = 45 hg” = .64 


the mean r. An alternative is to compare the median residual, disregarding sign, 
with the probable error of the median r. (Thurstone, 1938.) J 
3. Obtain the standard deviation of the residuals for the sth factor. Obtain 
the average of communalities for all tests for the s factors extracted (h,2). Compute 
€, = os/(1 — hs2). The sth factor is significant if ø, exceeds the standard error of a 


zero r. (McNemar, 1942a.) 
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2. Reproduce the correlation matrix including the communalities 
from the factor loadings. (FF’ = R.) . 

3. Plot the factor loadings and rotate the axes graphically to posi- 
tive manifold. Obtain the rotated loadings from the plot. Check 
the h?’s of the rotated loadings against the h?’s of the diagonal loadings. 

4. Rotate analytically the same number of degrees as was 3 
graphically in Problem 3. Check the h?’s of the rotated loadings “aa 
compare the rotated loadings obtained graphically with those obtair 

lytically. . 
a ae the intercorrelation matrix on page 17 by the centroid 
method using the highest r in each column as an estimate of the h?, 
and re-estimate the communality in each table of residual onan 
Repeat the entire process as many times as may be necessary to stabi 
ize the communality estimates and apply the criteria for sufficien 
factors. Check the results by “reproducing” the correlation cocan 
from the centroid loadings and then adding the coefficients in the fina 
table of residuals as was illustrated at the bottom of Table 5.18. 

6. Extract the factors from the correlation matrix below by the 
centroid method and apply the criteria for sufficient factors. 


S 
INTERCORRELATIONS OF Scores oF One Hunprep COLLEGE SOPHOMORE: 


0 
Test 1 2 3 4 8 € 7. BB 5 
1. ACE! Arithmetic 31 52 18 .21 42 10.08 E -A 
2. ACE Figure Analogies Al 59 31 27 .58 11 —10 . > = 
3. ACE Number Series 52-59 QT OF 38: .32 01 Ta ‘od 
4. ACE Verbal Completion .18 31 21 71 61 .19 —.04 Sa oe 
5. ACE Same-Opposite A 27 OF oi 68 39 19. 3 08 
6. ACE Verbal Analogies 42 .58 .33 .61 .68 Ot 12, «3 
7. Educational Psychology— 5 AT 
First Midterm 10 Al A AS «89. 221 62 6 
8. Educational Psychology— n 
Second Midterm 03 —.10 01 —.04 .19 .12 .62 50 
9. Educational Psychology— 60 
Third Midterm 24 14 13 18 44 33 63 «50 
10. Educational Psychology— 0 
Final Examination 07 —.16 —.05 04 .24 .08 .47 72.6 


. a Data 
1 American Council on Education—Psychological Examination, 1941 Edition. ( 
furnished by J. V. West.) 


CHAPTER © 


Multiple-Group and 
Principal-Axes Methods 


of Factoring 


Tur MULTIPLE-GROUP METHOD 


Tee MULTIPLE-GRouP method is a variation of the 
centroid method. It is of interest because, instead of extracting 
one factor at a time, several factors may be extracted simulta- 
neously. In this method some or all of the tests are placed in 
groups and the reference vectors are passed through the centroids 
of these groups instead of through the centroid of all the tests as 
is done in the complete centroid method. The only restriction 
on the groups is that they be linearly independent (see p. 201). 
Because of this restriction, the method is best used with variables 
about which sufficient is known so that independent groups can be 
selected. If the groups are not linearly independent, the method 
will yield imaginary numbers in one of the steps, and it will be 
necessary to start over with a new grouping of the tests. If groups 
of variables can be selected in advance that are similar in content 
to the factors to be obtained, the reference vectors obtained by 
group method will be closer to their final rotated 

87 


the multiple- 
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loadings and less rotation will be called for than in the centroid 
method. Thus, if one group were to consist of numerical tests, 
the vector passing through their centroid would be close to the 
final rotated position of the numerical factor. 

After several factors have been extracted simultaneously, a 
table of residuals may be computed and examined to determine 
whether it contains additional common-factor variance. If it is 
decided that the table of residuals contains additional factors, they 
may be extracted by the multiple-group method, centroid method 


or any other suitable method from the table of residual-correlation 
coefficients. 


Numerical Example. The correlations of the eleven variables 
in Table 6.2 which were factored previously by the centroid 
method will also be factored by the multiple-group method. 


A. Estimation of Communalities. In this method there is not 
an opportunity to re-estimate the communalities after the extrac 
tion of each factor. Therefore, it is desirable to obtain an im- 
proved estimate of the communality of each test at the start. This 
may be done by the centroid method of estimating the communal- 
ity according to the following steps: + 


1. For each test, make a 4 X 4 correlation matrix containing 
the variable itself and the three variables with which it correlates 
highest. This has been done for tests 1 and 2 in Table 6.1. The 
highest correlations in each column of the 4 X 4 matrix are in- 
serted in the diagonal cells. An estimate of the communality may 
be obtained from the formula 
Ed [271]? 

ET 


he (6.1) 


where žr, = sum of the values in the first column; 3T = sum of 
the values in the entire 4 X 4 table. Do this, in turn, for each 
variable. 

2. Enter the computed communality estimates in the diagonal 
cells of the correlation matrix. The estimates of the communall- 


1For other methods of estimating the communalities, see Thurstone (19476 
Chapter 13). 
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TABLE 6.1. COMPUTATION OF COMMUNALITY ESTI- 
MATES BY THE CENTROID FORMULA 


Test 1 


Test 2 


ties obtained by formula.6.1 have been inserted in the diagonal 


cells of Table 6.2. 

B. Grouping the Variables. 1. There are several methods 
by which the variables may be placed in groups for the extraction 
of factors. There should be, preferably, at least three or four 
variables in each group. One method of grouping might be to do 
a cluster analysis of correlations; or, if there is some a priori basis 
for groupings of variables, it can be employed. If the factor con- 
tent of the tests is known from previous studies, this information 
can be used to guide the grouping. The method is not dependent 
on the grouping, however, except for the restriction, mentioned 
above, that the groups be linearly independent. Also, the groups 
need not contain the same number of variables. 

In the example five groups were formed from the eleven tests 
and are indicated in Table 6.2. (In the example most groups 
consist of only two tests. It is not recommended that such small 
groups be used unless the tests and the way they can be grouped 
are well known to the investigator.) 

C. Obtaining the Oblique Factor Loadings. After the groups 
have been formed the steps are as follows: 1. Sum each column of 
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the correlation matrix by groups. Each column will give as many 
sums as there are groups. Place the sum for each group in the 
appropriate row below the correlation matrix. 

In the example, the first group consists of tests 1, 7, and 8. 
The sum of the correlations of these three variables with test 1 
is 1.59. This is placed in row G, in the column for test 1 of 
matrix S. 

The sum of variables 5 and 9 for group 2 is placed in line Ge 
in the column for variable 1. The sums for the other groups in 
each column are placed correspondingly in the proper row and 
column. This table of group sums by columns is labeled matrix S. 

2. Sum each row Gi, Gə, +++, etc., by groups, and place the 
results in an n X n matrix at the right (n represents the number 
of groups). This matrix is labeled “T.” 

In the example, the values for the first group (tests 1, 7, and 8) 
in row G; are summed (1.59 + 1.78 + 1.86 = 5.23). This value 
is placed in the first cell of row G, of matrix T. The values for 
the second group are likewise summed (.26 + —.17 = .09) and 
put in the second cell. The rest of the 5 x 5 matrix is obtained 
in a similar manner. 

3. Obtain the reciprocal of the square root of each value in 
the diagonal cells of matrix T. These are the weights which are 
applied to the group sums in matrix S to obtain the oblique factor 
loadings. The formula for the weights is 

W, : 6.2) 
oe (6. 
where d; is the value in the diagonal cell of matrix T in row j. 

In the example the value in the diagonal cell of row 1 is 5.23 
and its reciprocal, W; = .437269. The other weights are obtained 
similarly. 

4. Multiply each value in each row of matrix S by the weight 
for that row. Enter the results in a matrix V’ just below matrix S. 
In the example, the first value (.6953) is obtained by multiplying 
1.59 by W4. 


The values in the V’ matrix are the loadings of the variables 
on the reference vectors which were passed through centroids of 
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the groups. These groups, except in rare instances, are not or- 
thogonal to each other; in other words, the axes running through 
their centroids are oblique. The matrix V’ is the transpose of 
the matrix V, the matrix of oblique factor loadings. If the groups 
have been chosen carefully to represent meaningful clusters of 
tests or other variables and it is not desired to rotate the factors 
or change the final groupings, the analysis may be terminated at 
this point, after a table of residuals has been calculated and it is 
determined that all the common-factor variance has been ex- 
tracted. Or, the analysis may be continued through step 6, below, 
where the cosines of the angular separations of the oblique factor 
vectors are obtained. If, however, it is desired to rotate the factors 
they should be converted first to a reference frame of orthogonal 
factors by the following steps: * 

5. Multiply the values in each row of matrix T by the weight 
for that row and enter the results in matrix U. 

6. Multiply the values in each column of matrix U by the 
weight for the corresponding column (i.e., multiply the values in 
column 1 by W, in column 2 by Wz, etc.) and place the results 
in matrix R,,. The values in the diagonal cells should be within 
rounding-error of unity and the other values in matrix R,, are the 
correlations among the factors (i.e., the cosines of the angular 
separations of the unit-length reference vectors). 

7. In order to obtain a transformation matrix by means of 
which the loadings on the oblique factors can be transformed to 
loadings on an orthogonal system, the matrix Ry, is factored by the 
diagonal method with unity in the diagonal cells. (See pp. 52-58.) 
The results are shown in matrix F,,,. (This is the point at which 
the method usually breaks down because of imaginary numbers if 
the groups are not linearly independent.) 

8. The desired matrix which will transform the oblique to 
orthogonal reference axes is the transpose of the inverse of matrix 
F (i.e. the transpose of Em 

9. A procedure for obtaining the inverse of a matrix is de- 
scribed in Chapter 3 and may be applied here. Since, however, 
the matrix whose inverse is desired is a triangular matrix (one 


——— 
2¥For a method for going directly from the multiple-group loadings to the 
rotated oblique loadings (on the primary axes) see Harris and Schmid (1950). 
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which has zeros above the principal diagonal), its inverse may also 
be computed by a simpler method which will be described below. 


Calculation of the Inverse of a Triangular Matrix. It will 
be recalled from formula 3.1 that A-A-—1=[I. IfA is the tri- 
angular matrix with known values and A~? is the inverse which 
we wish to find, the relationship for a 5 X 5 matrix can be written 


as follows: 


tj © O © o X X2 %43 žia *5 1.0 0.0 0.0 0.0 0.0 
an % O O 0 Xa) X22 *23 X24 *25 0.0 1.0 00 00 00 
a34 a3 a3 O O Jej] x3 *32 x33 *34 *3s||=||00 0.0 1.0 0.0 0.0 
O41 %> Cag 144 o Xa %az Žas *aa *as 00 0.000. 1.0 0.0 
%1 Osa %53 Osa "55 *s1 *s2 *s3 “sa “55 00 00 00 0.0 1.0 
A at I 
f 
i| 1.0000 .0000 .0000 .0000 .0000 ny Xe ža xa xsl] || 10 0.0 00 oo oo 


.0294 .9996 .0000 .0000 .0000 
14922 .2023 .8466 .0009 .0000|!*|| x x39 x33 x34 *35||=|| 00 0.0 1.0 0.0 00 


157226218 -.0344 .5336 .0000 Xa Xap %ag Xaa %as|| || 0.0 0.0 0.0 1.0 0.0 
15032 6533. .3764 -.0671 4619 Xs so sy %sa %ss{| || 0.0 0.0 00 0.0 1.0 
Fa F.' I 


By means of the row-by-column rule of matrix multiplication, 
simple equations can be set up for the solution of the unknown 
values in the inverse matrix as follows: 


1. Row 1 X column 1 
aiı%11 + 0.0%21 + 0.0%31 + 0.0x41 + 0.0%51 = 1.0 
411x11 = 1.0 
1.0 _ 1.0 


x1 = =—=10 6.3 
a a1 1.0 ( ) 


2. Row 1 X column 2 
411¥12 + 0.0x22 + 0.0%32 + 0.0%42 + 0.0%52 = 0.0 
411%12 = 0.0 
#12 = 0.0 (6.4) 
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In the same way it can be shown that x13 = 0 = X14 = X15 and 
that all other values xj, (for j > k) to the right of the principal 
diagonal are equal to zero. 

8. Row 2 xX column 2 


az1%12 + a22%223 + 0.0x32 + 0.0%42 + 0.0x52 = 1.0 
i a21%12 + 422%22 = 1.0 
From equation 6.4 x12 = 0.0 and 
áz2%223 = 1.0 
or 
Hap = is = Si = 1.0004 (6.5) 


az 9996 


It can be shown in the same way that all other values in the 
principal diagonal of the inverse matrix (x;x) are the reciprocal 
of the corresponding values in the given matrix, and hence 


1 1 
ee ea, 6.6 
33 753-8466 ee 
1 1 
a TaD 6.7 
"ae Fa, D036 va 
1 1 
ie = ee 987 6.8 
X56 = aes, 41069 a 
and i 
24 (6.8a) 
Akk 


4. Row 2 x column 1 
+ 0.0x31 + 0.0¥41 + 0.0x51 = 0.0 


41X11 + G22%21 = 0.0 


21X11 + 422%21 


aati _ (.0294) (1.0) — —.0294 (6.9) 
.9996 


wei = 
422 


5. Row 3 X column 2 
431X192 + 432%22 + 433%32 + 0.0x42 + 0.0%52 = 0.0 


GgiXi2 + 432%22 + 433X32 = 0.0 
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In equation 6.4 it was shown that x12 = 0.0, hence 


a32%29 + 433X32 = 0.0 


and 
as2x22 (.2023) (1.0004) _ 


GES T ag 8466 

Similarly, it can be shown that, for x21, X32, X43 and for all 

other values of xz; (where k = j + 1), the value can be obtained 
by the equation 


—.2391 (6.10) 


La = = ae (6.11) 
Akk 
and hence 
443%33 (—.0344)(1.1812) 
x43 = mr 5336 .0761 (6.12) 
aaa (—.0671) (1.8741) 
X54 = radia H61 .3016 (6.13) 


6. Row 3 Xx column 1 
azıx11 + 432%21 + 433%31 + 0.0%41 + 0.0x5; = 0.0 
azı%11 + a32%21 + 433x31 = 0.0 


a31%11 + 4832%21 


Ss = 
433 


_ (4922)(1.0) + (:2023)(—.0294) 
-= Ae = —.5744 (6.14) 


It can be shown that the general formula for getting the value 
in any cell az; (where k > f) of the inverse matrix is: 


Xk = 


Onin, 418i 41.5 E Ak, i4242, j beth Ak, kikii (6.15) 


Akk 
Hence: 
442%22 + G43%32 
Cr an 
A44 
(.6218) (1.0004) + (—.0344)(—.3391) 
7 5336 


ll 


—1.1812 (6.16) 
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__  l53X33 F As4%43 
jes 2S U 


455 
(.3764) (1.1812) (—.0671) (.0671) 
4169 = —1.0542 (6.17) 
ap 441X11 + 442X21 + 443% 31 
41 
A44 

—_ (5722) (1.0) (.6218)(—.0294) (—.0344)(—.5744) 

= —1.0751 acai (6.18) 
a as2X22 + 453X32 + 25442 
52 = 

455 
N (.6533) (1.0004) (.3764) (—.2391) (—.0671)(—1.1812) 
4169 

= —1.5419 (6.19) 

Hones 45X11 + 452X21 + a53¥31 + 454841 
51 = 
a 
(.5032) (1.0) (.6533)(—.0294) | 

Z X (.3764)(—.5744)(—.0671)(— 1.1751) 

= 4169 

= —.8154 (6.20) 


In general, there are k minus j terms in the numerator of the 
equation for determining the value of xz; all values to the right 
of the principal diagonal are zero, the values in the principal 
diagonal of the inverse matrix are the reciprocals of the corre- 
sponding values in the principal diagonal cells of the given tri- 
angular matrix, and the other values are obtained by equations 
Similar to equation 6.15. The inverse matrix (Fpm—1) is therefore * 


1.0000 0.0 0.0 0.0 
-.0294 1.0004 0.0 0.0 
—.5744 —.23916 1.1812 0.0 
-1.0751 -1.1812 .0761 1.8741 


—.8154 -1.5419 -1.0542 -3016 
o] 
ee 


8 The accuracy of the calculations can be checked, of course, 


FomEpm—1 = I. 


0.0 
0.0 
0.0 
0.0 
2.3987 


by the equation 
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Since the matrix which will transform the oblique factors to 
orthogonal factors is the transpose of this matrix, it is written as 
follows: 


1.000 = .0294 — 5744 -1.0751 — 8154 

0.0 1.0004 —-—.2391 -1.1812 -1.5419 

0.0 0.0 1.1812 0761 -1.0542 

0.0 0.0 0.0 1.8741 -3016 

0.0 0.0 0.0 0.0 2.3987 
(ry 


The Orthogonal Factor Loadings. 1. To obtain the orthog- 
onal factor loading, multiply matrix V by matrix (F’mp)~? and 
enter the results in a matrix as in Table 6.3.* 


TABLE 6.3. ORTHOGONAL FACTOR LOADINGS (F) 
EXTRACTED BY THE MULTIPLE-GROUP METHOD 


Foctor 
1 | Dial and Table Reading... 21 61 
2 | Spatial Orientation I». .02 | .46 
3 | Reading Comprehension . 01 | .43 
4 | Instrument Comprehension, . .26 41 
5 | Mechanical Principles. s... .06 54 
6 | Speed of Identification.... -.02 |.45 
7 | Numerical OperotionsI.... -.01 64 
8 | Numerical Operations ... —.21 71 
9 | Mechanical Information.... -19 | .47 
10| Practical Judgment . PERT 3 ý = 01 239 
11| Complex Coordination se.. 25 |435 


2. To obtain a table of residuals use the orthogonal loadings 
in the usual way: 


Pik = fik — (ajar + Ajzlk2 *** jrQkr) 


The residual correlations after the extraction of the 5 factors 
in the example are shown in Table 6.4. They are, by inspection, 
sufficiently small so that no further factors need be extracted. 


4In matrix notation F = V(F'mp)—1. 
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T TE 6.4. RESIDUAL-CORRELATION MATRIX AFTER 
RACTION OF FIVE MULTIPLE-GROUP FACTORS 


' 23 4 &§ 6 7 8 GF 0 tt 
09 .01 .01 02 .00-.01 .02 06-03 7.01 .02 
01.02 -.01 -.01 .01 .01 .00-.01 -.02 01° .00 
01 -.01 -.04 .02 .01 .01 -.01 -.01 -.03 .03 -.02 
-02 -.01 .02 .00 .01 .00-.01 .03 .00 -.01 -.01 
00 .01 .01 .01 -.04 .00-.01 .01 .03 .00 .00 
01.01 .01 .00 .00-.01 .01 .02 .02 .00 .00 
02 .00 .01 -.01 -.01 .01 -.05 .02-.13 .00 .01 
06 -.01 -.01 .03 .01 .02 .02 -.08-.17 .01 ~.03 
9 |-03 -.02 -.03 .00 .03 .02-.13 -.17 -04 .03 00 
10-01 .01 .03 -.01 .00 .00 .00 .O! .03 -.02 02 
11].02 .00 -o2 -01 .00 .00 .01 -.03 .00 .02 _.00 


NOUR WN = 


œ 


variance remaining in the 
f additional factors, 
the multiple-group 
a table of 


I aan è 
s. there is sufficient common-factor 
die ual correlations to justify the obtaining 0 
woe mV be extracted in the same way by 

hod or b 6 

i any other method appropriate for 
residuais, 7 pprop 

3. i 
meth The orthogonal factor loadings may be rot 

ods to be explained in the next chapter. 


ated by the 


PRINCI > 
PAL-AXES METHOD 


The principal-axes method of factoring the cor- 


Yı . 
ag matrix is of interest for several reasons. Each factor 
of facto the maximum amount of variance (i.e. the sum of squares 
est la is maximized on each factor) and gives the small- 
the sear le residuals. The correlation matrix 3 condensed into 
method = number of orthogonal factors by this method. The 
east s also has the advantage of giving a mathematically unique 
et Juares) solution for a given table of correlations. 
ns X ractice this method has the limitation that the ana 
Centroj a laborious. Equivalent results can be obtained by the 
Extracte ane with less work if an additional factor or two 1s 
9 Talai, that the two solutions account for the same amount 
€. 


tio 


1935) has developed an itera- 


Name 
Umerical Example. Hotelling ( a 
al axes which 


tive 
met eons P n 
ethod of obtaining the loadings on the princip 
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can be carried to any desired degree of accuracy. As an example 
let us consider the fictitious 4 X 4 correlation matrix in Table 
6.5 with known true communalities entered in the diagonal cells. 


A. Iteration Procedure. 1. Enter the sum of each column of 
the correlation matrix (R) in the row labeled “3.” 

2. Divide each value in the row labeled 3; by the highest value 
in that row and enter the results in the next row, labeled “u.” 

3. Obtain the sum of the cross-products of row 3; with each 
row of the correlation matrix. Enter the results in the next row, 
labeled “32.” Thus the first entry of row X» is the sum of the 
cross-products of row 1l and row 3X, the second entry is the sum 
of the cross-products of row 2 and row 4%, etc. 

4. Divide each value in row Xə by the highest value in the 
row and enter the results in the next row labeled “us.” 

5. Compare the values in rows u, and us. If they agree to 
the desired degree of accuracy (in this example agreement to four 
decimal places will be required), the iteration is completed and 
the factor loadings may be computed. If the desired accuracy 
has not been obtained (as it has not in the example), the iteration 
is continued by the “squaring” process. 

6. Multiply the correlation matrix by itself, using the row-by- 
column rule of matrix multiplication. The result is the product 
matrix R? shown in Table 6.5. 

7. Sum the columns of matrix R? and enter the result in a row 
labeled >. As a check on the accuracy of the work to this point, 
the values in row 3 in matrix R and in row 3 in matrix R? 
should be the same. If they check, copy the values from row uz 
in matrix R into the next row in matrix R?. 

8. To obtain the expected values (33) for the columns of the 
next matrix (R4), sum the cross-products of row 3 with each row 
of matrix R? successively and enter the results in row 3X, of 
matrix R?. 

9. Divide each value in row 3; by the highest value in the row 
and enter the results in the next row, labeled us. 

10. Compare the values in rows us and us. If they agree to 
the desired degree of accuracy, the iteration is completed. If not, 
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TABLE 6.5. PRINCIPAL-AXES METHOD CALCULATIONS 


Correlation Motrix (R) Matrix R? 
Var. 1 2 3 4 1 2 3 4 
1 | (90) 60 48 .69 1 1.8765 1.4445 1.2330 1.6515 
2 | 60 (50) 44 .57 1.4445 1.1285 .9690 1.2895 
3 | .48 44 (40) .50 1.2330 -9690 .8340 1.1070 
4 


69 57 .50  (.65) 
Uy] 2.67 2n 1.82 2.41 
u, | 1.00 -79 -68 -90 
2| 6.2055 4.8315 4.1430 5.5215 
ua | 1.0000 .7786 .6676 .8898 


1.6515 
6.2055 


1.2895 
4.8315 


1.0000 -7786 
32.85079875 25.55073375 21.90066750 29.20083375 
1.00000000 .77778120 .66667078  .88889265 


4.1430 5.5215 
-6676 -8898 


Motrix 


Ri 


1 9.85557375 7.66510875 


7.66510875 596186375 


8.76013875 6.81355375 


6.56997750 8.76013875 
5.11020750 6.81355375 
4.38025500 5.84022750 
5.84022750 7.78691375 


2 

3 656997750  5.11020750 
4 

E 


a | 32.85079875 25.55073375 
u3 | 1.00000000 .77778120 
E4 | 91930287064 715.01334389 
1.00000000 .77777778 
2.29999999  1.78888887 
90000000 —.70000000 


21,90066750 2920083375 
66667078 88889265 
612.86858049 817.15810730 
-66666667 - 88888889 
1,53333332 2.04444443 
60000000 -80000000 


First Residual- Correlation Matrix 


-.03 .01 02 .O1 
100 -.33 -.67 -.33 


Ea | .1500 -.0500 -.1000 -.0500 
U2 | 1.0000 -.3333 -.6667 -.3333 
vjz] -1500 -.0500 -. 1000 -.0500 
2 -3000 -. 1000 -. 2000 -. 1000 


Eur =.2500 
JEuy =.50 
1/ Evz =2.00 


Zu,v =6.53086411 
VEU y =2.55555554 


1WEu y= -39130435 


Factor Loadings 
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the process must be continued with R#, RS, etc., until two succes- 
sive rows of u agree to the specified number of decimal places. 

In the example it was necessary to obtain matrix R‘ and to 
obtain the expected values 3, and u4. The values in rows us and 
u, agree to four decimal places and the iteration was, therefore, 


terminated. 


B. Computation of the First Factor Loadings. 1. To compute 
the loadings on the first factor, it is necessary to obtain the sum 
of the cross-products of each row of the correlation matrix, R, and 
the final row of u (us in the example). The results are entered 
in the next row labeled vj. 


vj, = Ra (6.21) 
2. The formula for obtaining the loading for variable j on 


the first factor is 


m 
a = vV Xuv (6.22) 


The quantity Suv is the sum of the cross-products of the final row 
u and row vj. In the example, Siv; = 6.53086411; Vus = 
2.55555554, and the reciprocal, 1/.\/Suguj, = .39130435. 

3. The factor loadings are obtained by multiplying the value 
of vj for variable j by 1/-V Rusu. 


C. Second and Succeeding Factors. 1. A table of residuals is 
obtained from the correlation matrix and first factor loadings by 
the same method as was used in the case of the correlation matrix 
and first centroid loadings (see p. 73). The formula is 


1Pjk = Vik — jiaki 


where ip denotes the first residuals. The table of first residuals 
for the example is shown in Table 6.5. 

2. The communalities may be re-estimated at this point, and 
the process that was used for extracting factor I is repeated to 
obtain factor II from the first table of residuals. 
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3. In the example (Table 6.5) the values in row u, and row 
Uz are identical, and it is not necessary to apply the squaring 
Process to obtain the second factor loadings. 

4. Compute vj by the formula 


Uja = 1p (6.23) 


In the example in Table 6.5, multiply each row of the first resid- 
ual matrix by us and enter the results in the next row, labeled vj. 
5. To obtain the loadings on the second factor first compute 


Luedvj2 = .2500 


The formula for the second factor loadings is 


Uj2 
oe Se 6.24 
lja PRT (6.24) 


The values are entered in the last row of the first residual 
matrix and in the factor matrix in Table 6.5. 

6. To compute the next and succeeding factors the second 
table of residuals is obtained, the communalities are re-estimated, 
and the process used in calculating the first factor is repeated. 
This is continued until the residuals are judged to be negligibly 
small. In the example the second table of residuals contains all 
zeros, and there are just two factors. 


In the principal-axes solution the sum of the cross-products 
of any pair of columns in the unrotated factor matrix is equal to 
zero, i 
When the sums of the squares of loadings for different factor 
columns (3a;,°) are unequal, successive values in the row u will 
Converge to the desired degree of accuracy in relatively few trials. 

f however, the sums of the squares for columns with the larger 
sums are nearly equal, convergence will be slow, and it may take 
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many trial values of u. These values (Xa;*), called the latent 
roots, are, of course, not known in advance, and in an actual 
problem, obtaining the principal axes may be laborious. 

After the principal-axes factors have been determined with 
known or estimated communalities in the diagonal cells, they may 
be rotated to psychologically meaningful positions according to 
the principles described in the next chapter. Another use to 
which this method has been put is factoring the matrix with either 
unities or reliabilities in the principal diagonal. The resulting 
factors are called “principal components” and are used to repro- 
duce the score matrix rather than the correlation matrix. The 
number of components extracted is equal to the number of tests 
in the battery (Hotelling, 1933b). 


OTHER METHODS OF EXTRACTION OF FACTORS 


The methods for extracting factors presented in 
this chapter are but a few of the many methods which have been 
developed. Several of the widely used or theoretically important 
methods not discussed in this introduction are group-centroid 
method (Thurstone, 1947c), the simple-summation method (Burt, 
1917), the maximum-likelihood method (Lawley, 1951). The 
group-centroid method might be said to lie between the multiple- 
group and complete centroid methods described above, and, al- 
though it is computationally simpler than the latter, it is probably 
not as effective. Burt’s simple summation method is very similar 
to the centroid method. Lawley’s maximum-likelihood method 
which is seldom used because it is computationally laborious gives 
the most mathematically “efficient” estimates of the factor loadings 
and has the added considerable virtue of providing a chi-square 
test for the number of significant factors. As high-speed comput- 
ing machines are more widely used, more exact methods such as 
the maximum-likelihood method are likely to gain wider usage- 

It can be shown that the results from the different methods 
are mathematically equivalent and that the loadings derived by 
one method can be transformed into those derived by any other 
method (see Wolfle, 1940). 
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PROBLEMS 


l. Factor the correlation matrix on page 17 by the multiple-group 
method. Use the results from the cluster analysis in Problem 1 of 
Chapter 2 to set up the groups. Estimate the communalities by the 
centroid-formula method. 

2. Make up a4 X 4 matrix consisting of the correlations of tests 3, 
4, 7, and 8 in the correlation matrix on page 17. Factor the 4x4 
matrix by the principal-axes method, using the communality estimates 
for these tests that were obtained in Problem 1 above. 


CHAPTER / 


Methods of Rotation: 
Orthogonal Axes 


F A given matrix of correlations, each of the 
factoring methods presented in the preceding chapter will arbi- 
trarily locate the reference axes in a different position. In order 
to move the axes from the arbitrary location determined by the 
method of extraction to some position useful for interpretation 
of the factors and for comparison with other studies, the axes are 
rotated. A major goal of rotation is to obtain meaningful factors 
that are as consistent (or invariant) as possible from analysis to 
analysis. The “proper” location of the reference frame depends 
upon the purposes and theoretical approach of the factor analyst.* 

Rotation is the aspect of factor analysis which is the most diffi- 
cult to teach_and the most difficult to learn, because it requires 
considerable background, insight, and skill. Only after rotating 
in several problems in a given area can one expect to have con- 
fidence in his ability to find satisfactory rotational solutions. In 
other words, while the mechanics of rotation are mathematical 
and relatively straightforward, criteria for determining where to 


1For a witty exposition of this point see Cureton (1939). Reprints of this 
humorous article are usually available, on request, from the World Book Co., 
Yonkers-on-the-Hudson, New York. 
106 
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rotate are largely in the nature of interpretations of the data and 
are based on theoretical and conceptual considerations. The loca- 
tion of the reference vectors will vary, therefore, with the pur- 
Poses of the investigation and the theoretical biases of the investi- 
Sator, as indeed is true of the interpretations of most scientific 
Investigations. 

In some instances the configuration of the test vectors makes 
the location of the reference vectors, within certain limits, fairly 
obvious. Thus, Figure 7.la presents a configuration of four test 
vectors in two dimensions. Where should the reference vector or 
vectors be placed? One possible solution might be to run a single 
reference vector through the configuration in such a location as 
to maximize the sum of the squares of the loadings as in Figure 

:lb. This would be the first principal axis and all four variables 
would have high loadings on it. If it were not rotated, it would 
be a general factor and would account for a great deal of the 
Variance that the four variables have in common. If a second 
factor (II) were extracted, orthogonal to the first, we have the 
Situation diagramed in Figure 7.lc. These two factors could be 
kept orthogonal to each other and rotated to the position shown 
in Figure 7.1d, or two planes, oblique to one another, could be 
Passed through the centroids of clusters of highly correlated tests, 
SO as to maximize the number of zero or near-zero loadings, and 
the factor axes (called normals to the planes) placed perpendicular 
to the planes at the origin as shown in Figure 7.le. As can be seen 
from the diagram, the oblique solution is more complicated than 
the others, It will be discussed in more detail in the next chapter. 

There are, of course, an infinite number of locations of orthog- 
Onal or oblique axes, but those mentioned above are some of the 
More common ones used to account for variance common to clus- 
ters of tests, Spearman’s general factor, in the case where there is 
a unidimensional matrix (i.e., of rank 1) is represented by Figure 
7I, where the departures of the variable vectors from the refer- 
Ence vector and from each other are no-greater than can be at- 
tributed to random sampling fluctuation. 

If the configuration of test vectors in Figure 7.1f represents a 
departure from a single dimension that is greater than chance, 
SO that two dimensions are required to account for the interrela- 
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Ficure 7.1 Locations for reference axes. 
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tionships, the G-vector might be passed through test 1 and a group- 
factor vector, orthogonal to G, might be used to represent that 
portion of the common variances of tests 3 and 4 not accounted 
for by G, as is done in the Holzinger bi-factor solution. Spear- 
man’s followers might also accept this type of location of the 
reference axes in configurations of two or more dimensions. 

Thurstone and others who favor the multiple-factor approach 
find locations of axes such as in Figures 7.1d or 7.le more useful. 
They use orthogonal or oblique reference axes, depending upon 
which seem to fit the configuration better. A number of factor 
analysts, however, prefer the orthogonal solution, wherever it can 
be justified because of its greater simplicity and because it postu- 
lates independent reference variables. 

Often in setting up a factorial study the variables can be 
chosen so that a given configuration is anticipated. When this is 
done, there is some basis for the location of the rotated reference 
axes, Thurstone has developed criteria for setting up a factorial 
investigation so that a relatively unique configuration may be 
expected and a relatively standard location for the reference frame 
can be established.? These are known as the criteria of “simple 
structure” and “positive manifold.” 


SIMPLE STRUCTURE AND Posrrive MANIFOLD 

These criteria will be illustrated with reference to 
a three-dimensional problem. Figure 7.2 shows 11 test vectors. 
All but test 11 have been selected so that their vectors lie in one 
Of three planes (A, B, and C) which are orthogonal to one another. 
if three orthogonal reference vectors (vector 
1, perpendicular to plane A, etc.) are placed at the intersections 
of these planes, no variable in a plane will be loaded on more 
than 2 factors, A vector which coincides with the intersection of 
two planes will have a loading on that reference vector only and 
is referred to as a “pure” measure of the factor. Variable 2, for 
Instance, has loadings (projections) on reference vectors I and II 
but none on III. Vector 4 has a loading on II but none on I 
and III, since it is perpendicular to them. An eleventh variable 


es 
2 Thurstone (1947c, Chapter TX). 


In this arrangement, 
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is shown in Figurép.2 that is not in any of the planes but it is in 
the space bounded by them. It has loadings on all three of the 
factors, but it would not help to determine the positions of the 
planes and reference vectors. 

Where several vectors lie in a plane (or for more than three 
dimensional configurations, a hyperplane) and determine its loca- 
tion, there is a relatively 
unique location for the refer- 
ence vectors at the intersec- 
tions of these planes (or hyper- 
planes). Thurstone (1947c, 
p. 335) has set up specifica- 
tions for simple structure, for 
the case where each variable is 
less complex than the battery 
as a whole. They can be 
stated as follows: 


0 


l. Each variable should 
have at least one loading close 


Ficure 7.2 Illustration of simple struc- tO Zero. 


ture and positive manifold in a three- 9 7 for 
dimensional configuration. 2. There should be, 


each factor column, at least as 
many tests with zero loadings as there are factors. (Thus in the 
three-dimensional, orthogonal illustration each reference vector 
should have at least three variable vectors orthogonal to it.) 

8. For every pair of factors there should be several variables 
with projections on one factor vector but not on the other. 

4. For problems having four or more factors, a large propor 
tion of the variables should have negligible Joadings on any pair 
of factors. ` 

5. Only a small number of variables should have appreciable 
loadings on any pair of factors. 


The essential aspects of simple structure are that each variable 
contains fewer factors than the configuration as a whole, that each 
variable contains a different combination of factors, and that the 
location of each plane (or hyperplane) is determined by several 
test vectors. 

Positive manifold exists when the axes can be rotated so that 
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all loadings are positive or zero. In practice, small negative load- 
ings are tolerated. This principle is especially useful in the area 
of abilities, whose measures, generally speaking, are positively 
correlated. In dealing with temperament traits, both positive 
and negative loadings occur, and the restriction of positive mani- 
fold does not apply. In that case 
bipolar factors frequently occur, 
but the criteria for simple struc- 
ture can still be applied. 

Thurstone (1947c, Chapter 
IX) has distinguished a number 
of configurations of variable vec- 
tors which, together with their 
factor patterns, may be repre- 
sented for the three-dimensional 
orthogonal case by means of 
Spherical right triangles. 

Figure 7.3 shows how such a FIGURE 7.8 Showing the algebraic 
spherical right triangle is drawn $805, g, Senich would also be the signs 


on the surface of a sphere. The of the three factor loadings of a vari- 
algebraic signs indicate the gi. able Whose vector i located in the 


rection of the loadings on the 
three factors. Thus a variable vector ending on the line between 


the ends of axes I and III would have positive loadings on factors 
I and III and a zero loading on factor II. 
Below are some typical configurations. 


l. Isolated constellations. 
Factor Pottern 


1 
2 
3 
4 
5 
6 
z 
8 
9 
10 


e of simple structure to recognize. The 
lated and have very low 
clusters. Each vari- 


This is the easiest typ 
Variables in each cluster are highly corre 
correlations with the variables in the other 
able is a relatively pure measure of its factor. 
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This same configuration of variables will yield different factor 
patterns if the reference axes are rotated to a position other than 
simple structure. These other patterns, such as would be ob- 
tained by putting axis I through the centroid of all of the vectors, 
although preferred by some investigators, are less simple and seem 


fess useful. 
+9 Foctor Pattern 
Test| I 


ETUE ETSSI 


SCMVaAUAWNH— 
XK KK KK OC 
rrrixxx 


Foctor Pattern 


SGeC®VOUSWNH 


This is similar to the configuration pictured in Figure 7.2. 
The tests in the corners at the intersections of the planes are of 
complexity one. All other tests are complexity two. This is an 
example of simple structure and positive manifold. 

3. Simple structure without positive manifold, due to the 
presence of bipolar factors. 


10 I Factor Pattern 


1 
2 
3 
4 
5. 
6 
7 
8 
9 
10 
1 
12 
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4. Incomplete triangular configuration. All tests have com- 
plexity of two except those that lie at the intersections. 


Foctor Pattern 


1 

2 
3 
4 
5 
6 
Fi 
8 
9 


In this configuration there is one pure measure of each of the 
factors and other variables which are measures of combinations 
of factors I and II, and I and III, but none of the variables are 
measures of combinations of factors II and III. In such a situa- 
tion it would be of interest to try to devise a measure which com- 
bined factors II and III. If the attempt were successful, it would 
be further confirmation that the factors had been meaningfully 
interpreted. 

Other more complex configurations frequently occur in ex- 
perimental data. If possible, the factorial experiment should be 
designed with some definite configuration in mind so that simple 
Structure or other principles can be applied to guide the rotations. 
Tt should be noted, moreover, that alternative simple structures 
can sometimes be located in experimental data that do not have a 
clear-cut structure. Repeated studies in the area or theoretical 
Considerations should help to determine which is the most useful 
or meaningful. 

A number of other criteria for determining the rotated posi- 
tions of factors have been discussed by Cattell (1944c). Briefly 
Summarized they are as follows: 

l. Rotation to agree with clinical and general psychological 
findings. Passing the axes through clusters of variables or syn- 
dromes known to exist from clinical or psychological theory and 


Observation. 

2. Rotation to agree with factors from past factor analyses. 
This principle would be especially applicable in fields and for 
Populations with well-established factors and well-known measures 


of the factors. 
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3. Rotation to put axes through the center of clusters. The 
axes may be passed through either the outstanding correlation 
clusters in the correlation matrix or through the clusters which 
appear on the plots of factors represented two at a time. 

4. Rotation to agree with successively established orthogonal 
factors. If there is only one new factor in a matrix of r factors, 
it can be placed orthogonal to the first r — 1 known factors. In 
this way starting with one factor of known position and succes- 
sively adding one factor at a time, a large number of factors could 
be rotated into position. This principle, of course, has the limita- 
tion of applying to orthogonal factors only. 

5. Rotation to produce loading-profiles congruent with gen- 
eral psychological expectations. Factors representing traits ac- 
quired by education or training might be expected to have either 
very high or very low loadings, while factors representing more 
basic, constitutional traits might be expected to have a more even 
distribution of loadings. If these two kinds of factors are present 
in a study, the rotations might be guided partly by this principle 
so as to bring about the maximum number of factors with the 
expected profiles of loadings. 

6. Rotation to produce parallel proportional profiles. ‘The 
purpose of this criterion is to extend the principle of parsimony 
over a wide range of studies rather than restricting it to one study 
at a time, as is true of the simple-structure principle. For this 
reason Cattell has given it the alternate title of “simultaneous 
simple structure.” In considering a large number of studies, if 
there are fundamental, unitary factors common to them, the rota- 
tions should be done so as to get maximum agreement among all 
of the studies. If the studies differ in their experimental condi- 
tions or the populations to which they are applied, the agreement 
of the profiles of factor loadings would be expected to be pro- 
portional rather than absolute. In this way agreement could be 
obtained over a wide area of application and a wide variety of 
conditions. 


GRAPHICAL ROTATION 

One of the most convenient methods of rotation 15 
to plot pairs of orthogonal axes and rotate them graphically. To 
get a complete picture of the structure at any point in the rotation 
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process by this method requires the plotting of all combinations 
of pairs of axes. For r factors this would require 147(r — 1) plots. 

After a pair of axes has been rotated, the rotated numerical 
values can be read from the graph and the new values plotted. 
Te much rotation is to be done, it is advantageous to construct a 
simple apparatus designed by Zimmerman (1946) which requires 


al axes, showing how point 2 


Ficure 7.4 Graphical method for rotation of orthogon: an die) 


can be plotted directly from rotated axes I, and IV,. (See Zimmerm: 


a T-square with a special attachment and a triangle. This equip- 
ment eliminates the steps of converting the projections to numer- 
ical values after each rotation and the necessity of replotting the 
Numerical values. Figure 7.4 shows how to set up and plot a point 
from a pair of rotated axes using this equipment. 


Rotation of the Eleven-variable Example.’ The first step is 
to plot the loadings of each centroid factor to some convenient 


ae 
SIE a positive manifold or something close to one is expected, because a xi 
the correlations are positive, Or if negatives are few and low, a first step mig) be 
to make a preliminary algebraic rotation by means of a Tangan en are 
Matrix. Jf relati re positive, this has the effect of inclining a 
all of the correlations are p ; has | ts z 4 
the rotated axes at 45° to the first centroid. This yields positive manifold with 
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scale on a separate piece of graph paper. If graph paper with 10 
divisions to the inch is available, a scale of .10 to the half-inch (i.e., 
10 inches for the range from +1.0 to —1.0) is usually convenient. 
The next step is to plot the factors, two at a time, for rota- 
tion. It is customary to let the horizontal axis represent the factor 
with the lower number (e.g., if factors III and VII are being 
plotted together, factor III would be put on the horizontal axis 
and factor VII on the vertical axis). This arrangement gives a 
uniform meaning to clockwise and counterclockwise rotation. 


Rotation 1. Figure 7.5 shows centroid factors I, and II, from 
Table 5.18 plotted together on a two-dimensional graph. ‘These 
two axes were selected for rota- 
tion because the positive end of 
factor II, contains two tests with 
mechanical content (Test 5, 
Mechanical Principles, and Test 
9, Mechanical Information) and 
the negative end contains three 
“numerical” tests (Test 7, Nu- 
merical Operations I, Test 8, 
Numerical Operations II, and 
Test 1, Dial and Table Read- 
ing). Rotating axis II, with axis 
I, which has high positive load- 
ings for all the tests, in a counterclockwise direction, enables us tO 
put these two groups of tests on orthogonal (independent) factors 
and thus come closer to one of the requirements of simple struc 
ture (i.e., having tests that have high loadings on one factor and 
near zero loadings on the other) and positive manifold. In label- 
ing the rotated axes, what was formerly the negative end of axis u 
has been labeled as the positive end. This reflection of axis II 1s 
desirable because it results in most of the loadings on factor II be- 
ing positive. In contrast to reflecting a test, reflecting a factor, it 
will be recalled, has no effect on the signs of the correlations. 


-1.0 


-1.0 


FIGURE 7.5 


one rotation. Landahl (1938) gives the transformation matrices for three to uim 
factors and the simple rules for constructing these matrices of any rank. It a: 
still necessary, of course, to rotate the factors to simple structure or any othe 
position desired for interpretation. 
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Rotation 2. In looking for the next two factors to rotate to- 
gether it was decided to pair centroid factor III (III,) with rotated 
factor II (Ii),* as shown in Figure 7.6, because this would give a 
Separation of the numerical tests (7, 8, and 1) from two tests which 
seem to involve perceptual tasks (2 and 6). 

In general, for analyses in which positive manifold is expected 
(because the correlation matrix contains no significant negatives), 
an advantageous configuration for rotation exists when two groups 


Plot 2 10 


Plot3 1.0 


=1.0 


=1.0 


FIGURE 7.6 FIGURE 7.7 


Of tests, which are separated (i.e., one group positively loaded and 
the other group negatively loaded) on one factor, have positive 
loadings on the other. The group of tests which has positive 
saturations on both factors will be plotted in the ++ quadrant 
of the graph and the other group in the +— or —+ quadrant. 
(Sce Figures 7.6.) In the former case a clockwise rotation would 
be indicated and in the latter a counterclockwise rotation. 


Rotation 3. Rotated factor III (III) and centroid factor IV 
(IV,) were plotted together and rotated next as in Figure 7.7 for 


the following reasons: 
ee ee 


‘In Working with graphs it is well to cross off all but the last rotated position 
pe each axis. If this is not done it is a common error for the novice to plot the 
ae on a previous position of the axis (e.g., in this case IIe) and then wonder 
Why his communalities, etc., do not check. Another device for avoiding this error 


L . . 
5 to keep a list of the rotations and to check it cach time to make sure that the 


la i k 3 
test rotated position of an axis is being plotted. 
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1. The loadings of the perceptual tests (2 and 6) are increased 
on factor III. 

2. The number of zero and near-zero loadings on factor IV is 
maximized. 

3. The separation between the two groups of tests (group 1 
consisting of 5, 7, 8, and 9 and group 2 consisting of 3 and 10) is 
increased so that these groups can be separated still better when 
paired with another factor in which both groups are positive in 
a future rotation to positive manifold. 


Rotation 4. Figure 7.8a indicates how two previously rotated 
factors (I, and IV;) are plotted. The purpose of this rotation is 
to split the mechanical tests (5 and 9) from Test 3, Reading Com- 
prehension, and Test 10, Practical Judgment, both verbal in con- 
tent, on factor IV. In this case it is seen that the two groups of 
tests are less than 90 degrees apart. This situation might call for 
oblique rather than orthogonal axes. Since, however, for the 
present solution orthogonal reference axes have been specified, 
we will rotate so that the axes are approximately equidistant from 
the two groups.° 

Rotations 5 through 12 are also shown in Figure 7.8. Plot 5 
(Figure 7.8b) was made to separate the verbal tests (3 and 10) from 
the spatial tests (11, 4, and 1). Plot 6 (Figure 7.8c) was made to 
improve the loadings of the verbal tests (3 and 10) on factor I, to 
split the mechanical tests (5 and 9) from the other tests on factor 
II, and to maximize the number of near-zero loadings. 

Rotation 7 (Figure 7.8d) was made to maximize the number 
of zero and near-zero loadings and to increase the loadings of the 
numerical and mechanical tests on their respective factors by 
putting the axes more nearly through them. These two factors 
in an oblique representation would be slightly negatively related 
as might be expected, since numerical and mechanical tests tend 
to have low negative correlations (cf. Table 5.10; fst = —.02, 
Ti = SMA and Ts9 = —.06). P 

In rotation 8 (Figure 7.8e) slightly correlated axes are indi- 
cated. Since orthogonal axes are being used, however, the rota- 
tion indicated was made to improve the positive manifold. Like- 


5 See Chapter 8 for an oblique solution. 
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wise, in rotation 9 (Figure 7.8f), oblique axes are indicated. The 
rotation shown is a compromise to make the two clusters (spatial 
tests 4, 11, and 1, and perceptual tests 2 and 6) about equidistant 
from their respective axes. 

The mechanical (tests 5 and 9) and verbal (tests 3 and 10) 
clusters in rotation 10 (Figure 7.8g) are also correlated, and the 


rotation was made for the same reasons as given for rotation 9. 
Plot 7 


Plot 4 Plot 5 Plot 6 


Ficure 7.8 


Rotation 11 (Figure 7.82) is a slight adjustment to improve the 
Positive manifold. 

A complete picture of the locations of the rotated factors could 
be obtained by plotting all combinations of pairs of the latest 
Totated positions of the factors. For five factors 10 plots would 
required. Examination of these graphs should indicate 
whether or not further rotations would improve the simple struc- 
‘ure or positive manifold. i 
_ After it has been determined that no further rotations are 
indicated, the rotated values can be read directly from the graphs, 
if the plotting and rotating have been done accurately, by placing 
* Piece of graph paper marked with the scale to which the load- 
gs were plotted under the final rotated plots and reading off 
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the values. If this is to be done, it is well to do the plotting and 
rotating on onion skin or other transparent paper. 


The accuracy of the results can be checked by comparing /i’'s 


and reproduced r’s obtained from the rotated loadings with those 
which have been previously obtained from the centroid loadings. 
The rotated loadings, as read from the graphs, are shown in Table 
7.1 and the communalities and reproduced r's obtained from the 


TABLE 7.1. 


ZS eeanoauvn wn - 


rotated loadings are compared with those obtained from th 


GRAPHICAL ROTATIONS 


. Dial and Table Reading... 
. Spatial Orientation I... 


. Reading Comprehension , 
. Instrument Comprehension 
. Mechanical Principles... 
. Speed of Identification. , 
. Numerical Operations I . 
. Numerical Operations I . 
. Mechanical Information , 
. Practical Judgment .... 
. Complex Coordination .. 


FACTOR LOADINGS AFTER ELEVEN 


rea r] leet 

i pg v T Loadings 
.22 | .55 | .34 | 0 | .39 | .62 62 
.06 | .15 | .65 | .02 | .20 | .49 49 
154 | <29) |07 | <22: | 24 47 47 
NG | AS | cee | sal 46 | .40 40 
.20 |-.02 | .07 | .66 | .26 | .55 55 
-07 | 10 | .63 | .10 | 15 | .44 44 
-08 | .79 | .09 |-.10 | .13 | .67 67 
16 | .81 | .14 | 02 | Da | .70 -70 
¿17 [-11 |.01 | .61 | 06 | .42 42 
SA 41 | .07 | .27 | 15 | .40 40 
.04 | .10 | .29 | .22 | .47 | .36 36 


ODN OUKRWN 


5 


mov anaurun = 


Corr.Reproduced| Corr. Reproduced 
from Centroid 
Loadings 


from Rotated 
Loadings 


-40 
18 
31 
.37 
A7 
As 
-67 
-.05 
«25 
19 
.34 


e cen- 


troid loadings (originally given in Table 5.18). The communali- 
ties all agree to two decimal places, but there are some discrep” 


Methods of Rotation: Orthogonal Axes 121 


ancies of 0.01 in the two sets of values for the reproduced correla- 
tion coefficients. A more complete check would be to reproduce 
the entire correlation matrix rather than only those correlation 
coefficients immediately above the principal diagonal. 


CALCULATING THE TRANSFORMATION MATRIX 

If the rotated loadings obtained graphically are 
not sufficiently accurate, more accurate loadings can be calculated 
by means of a transformation matrix. The procedure for setting 
up a transformation matrix for the eleven-variable problem is 
illustrated in Tables 7.2 through 7.8. The transformation matrix 


TABLE 7.2. ROTATION 1 


Matrix Ay Matrix S, 


Y 
1.00 .95 0 0 0 
-95 -1.00 0 0 0 
0 1.00 «23 


Matrix Lyp= Ay Sy2 
I I 0 w 


Y 

1.00 95 (o) 0 o 
-95 -1.00 0 o o] 
o 0 1.00.23 0 


1 

1 

m 

W| o o -233 10 0 

ZI ğ o o o 1.00 
re 1.9025 1.9025 1.0529 1.0529 1.0000 
VE} 43793 13793 1.0261 1.0261 1.0000 
2 7250 .7250 .9746 .9746 1.0000 


.7250 .6888 0 
6888 -7250 0 0 

o O 9746 2242 0 
o O -2242 .9746 0 
0 o o 0 1.0001 


(represented by the Greek letter An) in Table 7.2 is an identity 
matrix, indicating that at the beginning no rotations have been 
made. Matrix As, the transformation matrix which represents 


is 
SIE the first rotation were a Landahl transformation, as suggested in the note 


on pages 115-1 16,A,, would be the Landahl matrix rather than the identity matrix. 
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TABLE 7.3. ROTATION 2 


Motrix Ayo Matrix S23 
I I = ww 4 
I 7250 6888 0 0 0 
a | .6888 -7250 0 0 o 
u| o O 9746 2242 0 
w| o O -.2242 9746 0 
y| o 0 0 0 1.0000 
Matrix L = Ag Sa 
I 1 I x Y 
.7250 .6888 .4959 .5220 o 
6888 -.7250 -.5220 .4959 0 
-1614 -7017 9746 .2242 0 
-7017 1614 -.2242 .9746 0 
o o0 0 0 1.0000 
4.5185 1.5185 1.5185 1.5185 1.0000 
1.2323 1.2323 1.2323 1.2323 1.0000 RANA Ms 
«8115 .8115 .8115 .8115 1.0000 I I pig E 


x 
I .5883 .5590 .4024 .4236 0 
I | .5590 -.5883 -.4236 .4024 0 
m | -.1310 .5694 .7909 .1819 0 
I | -.5694 1310 —1819 .7909 O 
y 1.0 


0 0 0 o 


the rotation of the axes from position 1 to position 2, is obtained 
by the following steps: 


l. Obtain matrix Sı». This is done by selecting plots so that 
none of the factors appears more than once. Plot 1 (Figure 7.5) 
was selected because it contains axes I, and II,. Plot 3 (Figure 
7.7) was selected because it contains axes III, and IV,.7 Plot 2 was 
passed over temporarily because it contains axis II, which is al- 
ready represented for this algebraic rotation in plot 1. The pt 
jection of each rotated axis on the unrotated axes is determine 
from the plot and entered in the proper cell in matrix S12. Thus 


7 II; may be taken before IIIo. The order in which the rotations are taken 
will not affect the final rotated positions of the axes. d 

It might be pointed out here that the order in which the axes are listed 2D 
numbered in the factor matrix is also arbitrary and any two columns may ; é 
interchanged. Thus if the rotation in plot 1 had been considered as clockw15 
rather than counterclockwise the factor labeled II would have been labeled I, an 
vice versa. 
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TABLE 7.4. ROTATION 3 


Matrix Ay; Matrix S34 


0 u .N Y 1, = my, F 
I 5883. .5590 .4024 .4236 0 [o] oO o 64 
m | 5590 -.5883 -.4236 .4024 0 100 0 -34 0 
m | -1310 .5694 .7909 .1819 0 o 100 0 0 
W | -.5694 1310 -.1819 .7909 0 34 o 100 0 
z 0.0 o o o 4.00 
Matrix Lis =A S34 Motrix D4 
I I a Eu Y 
5883 .7030 .4024 .2335 .3765 I | .8422 
15590 -.4515 -.4236 .6024 .3576 1 9468 
1310 -5076 .7909 .3755 -.0838 pug 1.0000 
=5694 .3999 -1819 .7464 -.3644 w -9468 
i -6400 0 0 O 1.0000 t4 -8423 
EL? | 1.4096 1.1156 1.0000 1.1155 1.4094 
1.1873 1.0562 10000 1.0562 1.1872 Maia Ag 
8422 9468 1.0000 .9468 .8423 


.4955 .6656 .4024 .2211 .3171 | .9999 
.4708 -.4275 -.4236 .5704 .3011| .9999 
-.1103 -.4806 .7909 .3555 -.0706) 1.0000 
-.4795 .3786 -.1819 .7067 -.3069) 1.0000 
~.5390 0 0 0 8423} 1.0000 


HRA 4 


the projection of I, on I, in rotation 1 (Figure 7.5) is +1.00 and 
has been so entered. The projection of I, on II, is +.95. The 
unrotated axes are listed down the side and the rotated axes across 
the top of matrix Sı» The projection of II; on II, is —1.00 (the 
Negative sign being due to the fact that axis II was reflected in 
this rotation). The projection of I; on Ie is +0.95. The projec- 
tons from plot 3 are similarly obtained and entered on matrix 
Sie, Matrix Sy. is always a symmetrical matrix (for orthogonal 
Totation) with plus or minus one in the diagonal cells. 

2. Obtain matrix Liz from the product AiSi2. Since Ai is 
an identity matrix, Liz and S12 are identical matrices for the first 
rotation, This is the transformation matrix in terms of “long” 
vectors (i.e., vectors of greater than unit length). To convert this 
Matrix into one for unit length vectors, it must be normalized 
(i.e., the sum of squares of each row and column of the matrix 


should be made equal to 1.00). 
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TABLE 7.5. ROTATION 4 


Matrix Aga Matrix Sas 
I I f Fd Y 


1 4955 .6656 .4024 .2211 3171 0 

t -4708 -.4275 -.4236 .5704 .301 o 

m | -.1103 -4806 .7909 .3555 -.0706 pue o o 1.00 o -30 

W | -4795 .3786 -.1819 .7067 -.3069 w 0 0 0 1.00 0 

E j 38390 ol 0 0 -8423 Y, [o] 0 s30 o 1.00 
Matrix Ly. = Ma Sas Matrix Ds 


I pig Ww € 

-7219 .4971 .3073 .2211 14378 
-3255 -.5876 -.5139 .5704 1740 
~.2737 —4431 .8121 .3555 1667 
-.3508 .5416 -.0898 .7067 -.3615 
-5390 .1833 -.2527 0 8423 


11154 1.1156 1.0900 1.0000 1.0899 


1.0561 1.0562 1.0440 1.0000 1.0440 
+9469 .9468 .9579 1,0000 -9579 


Si8lanaee 


-6835 .4707  .2944 
-3081 -.5563 -.4923 .5704 .1667 
7.2592 -.4195 .7779 .3555 1597| 1.0000 
~3322 .5128 -.0860 .7067 -.3463| 1.0001 
=5104 1735 -.2421 0 -8068| 1.0001 


o- E- 


3. Normalize matrix Ly»: 


a. Get the sum of squares for each column (3/*), As a partial 
check it can be observed that any two axes that were rotated to- 
gether (as I and II or III and IV) should have the same column 
sums. 

b. Obtain the square root of xP: VSP. 


c. Obtain the reciprocal of VIE: dp. 
d. The values in row 
D». 


də are used to write a diagonal matrix 


4. Multiply Lis by D, (which is the equivalent of multiplying 


each value in a column of Liz by its da value). This will give the 
normalized matrix Aj», which is the transformation matrix for 
the two plots considered thus far. A partial check can be ob- 
tained by calculating the sum of squares of each row of matrix 
Aue (3A). These values should be 1.0000 within rounding error. 
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TABLE 7.6. ROTATION 5 
Matrix Ais Matrix Seg 


m w 


4707 2944 2211 4194 
+3081 -.5563 -.4923 .5704 .1667 
7.2592 -4195 7779 .3555 1597 
-5128 0860 .7067 -.3463 
1735 -.2421 0 -8068 


4H 


Matrix Lig= Ays Soo 


+6238 .4707 .2944 4056 .4194 
1541 -5563 -4923 .6536 .1667 
T3552 4195 .7779 .2855 1597 
-5128 .0860 .6170 -,3463 
1735 2421 -.1378 .8068 
1.0731 10001 1.0002 1.0729 
1.0000 1.0001 1.0358 1.0000 
1.0000 .9999 .9654 


+6022 .4707 .2944 3916 .4194] 1.0001 
1488 —.5563 -.4923 .6310 .1667 .9999 
7.3429 =4195 .7778 .2756 1597] 1.0000 
75049 .5128 -.0860 .5957 -.3463] 1.0001 
=4927 4735 -.2421 -1330 .8068) 1.0001 


I 

I 
0 
w 
YX 


For the next rotation matrix we again select two plots which 
represent four different factors (in order to keep the factors orthog- 
onal). Graphical rotation 2 (Figure 7.6) contains axes II, and III. 

Graphical rotation 4 (Figure 7.8) contains axes I, and IV}. The 
Work for obtaining the matrix Ais which will represent these two 
8taphical rotations, and the two already accounted for by A1 is 
shown in Table 7.3 and was obtained as follows: 


I. Copy matrix Ais from the preceding table. 

2. Fill in the values in matrix S23 by obtaining the projec- 
tions of the rotated “long” axes in graphical plots 2 and 4 on 
the unrotated axes and entering them in the proper cells. 

3. Obtain matrix Lı; from the product of matrices Ais and Səs. 

4. Normalize matrix Lis, by multiplying it by matrix Ds. 

5. Obtain matrix Aj; from the product of matrices Ly3D;. 
Check the sum of squares of the rows to see that they are within 
rounding error of 1.0000. 


126 Introduction to Factor Analysis 


TABLE 7.7. ROTATION 6 


Matrix Aye Matrix Ser 
1 m g Y i, ç I m w F 
I 6022 .4707 .2944 .3916 .4194 14 | 1.00 o -20 o o 
i 1488 -.5563 -4923 .6310 .1667 i o 1.00 o o o 
m | -.3429 -.4195 .7778 .2756 .1597 Maj --20 o 1.00 0 o 
w | -5049 5128 -0860 .5957 -.3463 E o 0 0 1.00 0 
y | -.4927 1735 -.2421 -.1330 .8068 v 0 0 o o 1.00 
Matrix L= Aye Sez Matrix D7 


.4707 .4148 .3916 .4194 
-5563 -.4625 .6310 .1667 
-.4195 .7092 .2756 .1597 
.5128 -1870 .5957 -.3463 
.1735 -3406 -.1330 .8068 


1.0000 1.0399 1.0000 1.0000 
1.0000 1.0198 1.0000 1.0000 Matrix Av 
1.0000 .9806 1.0000 1.0000 1 I 0 w oy 


=.4782 .5128 -.1834 .5957 -.3463} 1.000! 
4356 


1 .5327 .4707 .4068 .3916 .4194| 1.0001 
m | .2425 -.5563 -.4535 .6310 .1667| .9999 
m | -4888 -.4195 .6954 .2756 1597] .9999 
x 
X. 


1735 -.3340 —.1330 .8068| 1.0000 


This routine is continued until all the plots have been used. 
In rotation 3 (Table 7.4) plots 5 and 7, containing axes Iz, Vo, Is, 
and IV;, are used. 


In rotation 4 (Table 7.5) plots 6 and 9, containing axes Is, IIs, 
III, and V; are used. 

In rotation 5 (Table 7.6) plot 10, containing I; and IVs is 
used. It was necessary to use only one plot in this rotation be- 
cause all of the other remaining plots contain either axis I oT 
axis IV, which are already represented and could not be put into 
the same S-matrix without disturbing the orthogonality of the 
reference axes. 

Rotation 6 (Table 7.7) involves I, and III, from plot 8. 

Rotation 7 (Table 7.8) involves III, and IV, from plot 11. 

Matrix ^is is the transformation matrix by means of which 
the rotated loadings can be obtained when the centroid matrix 
is post-multiplied by it. As a partial check on the accuracy of 
the work the transpose of the transformation matrix (A’1s) May 
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TABLE 7.8. ROTATION 7 


Matrix Ag Matrix S78 


3 I ug pg v 


.5327 .4707 .4068 .3916 .4194 
„2425 -.5563 -.4535 .6310 .1667 
~.4888 -4195 .6954 .2756 .1597 
-.4782 .5128 -.1834 .5957 ~.3463 
=.4356 1735 =3340 -.1330 .8068 


CE- E-E E 


Matrix Lyg= My S7 

Im & F 
.4707 .4303 .3672 .4194 
.2425 -.5563 -.4156 .6582 .1667 
~.4888 4195 .7119 .2339 .1597 
-.4782 .5128 -1477 .6067 ~.3463 
4356 1735 =3420 —.1130 .8068 


9999 1.0001 1.0035 1.0036 1.0000 


1.0000 1.0000 1.0018 1.0018 1.0000 
1.0000 .9982 .9982 1.0000 


4295 .3665 .4194| 1.0000 


.5327 .4707 


I (2425 -.5563 -.4149 .6570 1667 9999 
m | -.4888 -.4195 .7106 .2335 .1597 9999 
w | -4782 .5128 -.1474 .6056 -.3463 1.0000 


4356 1735 -.3414 ~.1128  .806' 1.0001 


be post-multiplied by the transformation matrix (Ais). For this 
or any transformation matrix for orthogonal factors the product 
should be within rounding error of an identity matrix (I). 

It will be observed from the above rotations that the order 
in which the plots are taken does not matter. Each plot should 
be used once and once only, and no factor should appear more 


than once in a single rotation. 


CALCULATING THE ROTATED VALUES 

As pointed out above, the rotated loadings may be 
obtained by post-multiplying the matrix of centroid loadings by 
the transformation matrix.” The rotated loadings may be checked 
by calculating the communalities and “reproducing” the correla- 
tion coefficients. They should agree with the corresponding values 


Computed from the centroid loadings. 
Saas 


8 For orthogonal rotation A'A = L 
F = FA, 2 
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FIGURE 7.9 


TABLE 7.9. ROTATION 8 


Matrix Aig 


m a Y 


I 4707 4295 3665 .4194 
I +2425 -.5563 -.4149 .6570 -1667 
m | ~4888 ~4195 .7106 +2335 .1597 
W | -.4782 5128 -.1474 -6056 -.3463 
Y 1735 -.3414 -.1128 -8068 
Matrix Lig= Ma Seo 
I a wy Y 


I | .4809 .5293 4295 3665 .a194 
I | .3037 -.5296 -4149 .6570 .1667 
I | -.4427 -.4733 -.7106 .2335 1597 
WY) -.5346 .4602 ~1474 .6056 -3463 
Y | =.4547_.1256 ~.3414 -.1128 8068 
Z1? | 1.0120 1.0122 .9998 1.0000" 1.0000 

VET? | 1.0060 1.0061 .9999 1.0000 1.0000 
dg | -9940 .9939 1.0001 1.0000 1.0000 


Matrix 


0 


Motrix 


Matrix 


A 


-4780 .5261 „4295 
I +3019 ~ 5264 -.4149 
M | -.4400-4704 .7106 
Æ | -5314 4574 -1474 
¥ | ~4520 .1248 -3414 -. 
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tor Analysis 


Seo 


Ig] 100 m o 
Ij- 100 ž ọ 

m 0 0 1.00 0 
nd te) 0 0 1.00 
Vv 


Dg 


3665 4194| 1.0000 
-6570 .1667 par 
2335 .1597| 1.000 
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The more accurate loadings obtained by matrix computation 
may be sufficiently different from those obtained by graphical 
rotation to make it advisable that they be examined for further 
needed rotations. In the illustration rotated factors I and II were 
plotted (see Figure 7.9) and a small clockwise rotation was made 
to reduce the loading of test 1 on factor I since its calculated load- 
img was considerably higher than its graphical loading. This 


TABLE 7.10. FACTOR LOADINGS AFTER EIGHT 
ALGEBRAIC ROTATIONS (F,Aj9) 


h? from 
r 4? | Centroid 
Loadings 


Test 


1 . Dial and Table Reading .. 
2. Spatial Orientation! . a 
3 . Reading Comprehension . 
4. Instrument Comprehension 
5. Mechanical Principles ... 
6. Speed of Identification. . 
7. Numerical Operations I . 
8. Numerical Operations I. 
9. Mechanical Information . 
10. Practical Judgment .... 
11. Complex Coordination . « 


Corr.Reproduced|Corr. Reproduced 
Tests from Rotated | from Centroid 
Loadings Loadings 

1,2 .40 -40 
2,3 18 18 
3,4 31 34 
4,5 .37 .37 
5,6 A7 al? 
6,7 15 15 
758 -67 -67 
8,9 -.05 =,05 
9,10 125) 
10,14 

1,1 


4 These reliability coefficients were taken from Guilford (1947b, Appendix B). 
te were obtained by the separately timed parts method and are corrected for 
Bth. 


si > The reliability coefficient for this form is not available. The reliability of a 
milar form is approximately .50. 
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rotation is added to the transformation matrix in Table 7.9. The 
rotated loadings and the checks on them are shown in Table 7.10. 


Ficure 7.10 Positions of axes after eight algebraic rotations. 


Plots of the final Totated Positions of all combinations of pairs of 
axes are shown in Figures 7.9 and 7.10. Several other minor 
adjustments of the axes are indicated in the plots in Figure 7.10 


(eg. m and IV slightly counterclockwise) but will not be made 
in this illustration. 
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PROBLEMS 


l. Rotate to simple structure and positive manifold the centroid 
factors obtained in Problem 5 of Chapter 5 by the graphical method. 
, 2. On the same set of axes plot both the rotated loadings obtained 
in Problem 1 above and the rotated loadings obtained in Problem 3 
of Chapter 5, in order to compare graphically the exact results ob- 
tained by the diagonal method with known communalities, and the 
approximate results obtained from the centroid method with esti- 
mated communalities. (Note: If the order of numbering the factors 
in the two solutions does not correspond, put the factors in the same 
order to facilitate direct comparison.) 

3. Rotate the centroid factors obtained in Problem 6 of Chapter 5 
to orthogonal simple structure. 

4. Rotate the factors obtained in Problem 1 of Chapter 6 to orthog- 
nal simple structure and positive manifold. 

5. Rotate the factors obtained in Problem 2 of Chapter 6 to or- 
thogonal simple structure and positive manifold. 


CHAPTER & 


Methods of Rotation: 
Oblique Axes 


= are several methods by which 
solution can be obtained. One fairly obvious w 
look at the orthogonal solution in the previous cl 
the hyperplanes oblique to one another, with the 
to the hyperplanes wherever the configur: 
I and V, II and II, III and V in Figure 
structure. The transformation 
exactly the same procedure as w 
with the addition th 


an oblique 
ay would be to 
napter and place 
axes orthogonal 
ation (as between axes 
7.10) indicates an oblique 
matrix would be obtained by 
as used for the orthogonal factors, 
at it is advisable to obtain a AA matrix after 
each rotation. This matrix, also referred to as the C 
the cosines of the angular separations of the oblique axes, These 
cosines should be taken into account in plotting oblique ax 
also for determining whether the axes are becoming so o 
that they no longer represent independent factors, 

is not necessary to obtain an orthogonal solution a 
to obtaining an oblique solution. 

can be rotated to oblique positions 


rMatrix, gives 


es and 
blique 
Of course, it 
S a first step 


The hyperplanes (and axes) 


as soon as the configuration 
132 
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indicates obliquity. Another useful method, which is used to 
locate one hyperplane and axis at a time, and usually results in 
oblique reference factors, is called the single-plane method (Thur- 
stone, 1947c, 216-224). 


SINGLE-PLANE METHOD APPLIED TO THE ELEVEN-VARIABLE 
EXAMPLE 

In this method the hyperplanes and corresponding 
axes are rotated one at a time. The method starts with the selec- 
tion of one of the variables. If a variable is known or is expected 
to be a pure, well-saturated measure of a factor, it is best to start 
with it; if not, any test whose communality is not very low may 
be selected. In the example, test 7 was selected as representing 
the start of a possible numerical factor. The steps for obtaining 
the rotated loadings in the V-matrix and the transformation 
matrix (A-matrix) are as follows: 


l. Select a test and write its centroid loadings and communal- 
ity in a column 4&jm (as was done for test 7 in the lower part of 
Table 8.1). 

2. Normalize the values in the column by computing Vh? and 
its reciprocal, and then multiplying each of the five loadings in 
column aj», by 1/ V. This gives the values in the next column, 
àma Which are the first trial values for the first column of the 
transformation matrix. 

3. Multiply each row of the centroid-loading matrix by the 
values in the column matrix Amg, to obtain the first trial rotated 
loadings. vjp = 3jmAmq- Thus, the first trial rotated loading for 
test 1 which is entered under trial 1 in the upper right-hand part 
of Table 8.1 was obtained as follows: 


.70 X 61 + —.29 X —.62 + —.08 X —.40 
+ —.11 X 18 + .15 X .20 = .65 


The rotated values for the other tests are computed and entered 


in the column for trial 1. 
4. Plot the first trial values against the loadings of each cen- 


troid axis, in turn, as shown in Figure 8.1. The plots are then 
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TABLE 8.1. SINGLE-PLANE METHOD OF ROTATION: CAL- 
CULATION OF THE ROTATED OBLIQUE FACTOR LOAD- 
INGS FOR THE FIRST COLUMN OF THE V-MATRIX 


Centroid Factor Loadings Trials 


Test 


1. Dialand Table Reading , 
2. Spatial OrientationI ...., 


3 Reading Comprehension ... 
4 Instrument Comprehension . 
5. Mechanical Principles .... 
6 Speed of Identification. 
7. Numerical Operations I.. 
8. Numerical Operations ... 
9. Mechanical Information .. 
10. Practical Judgment s.,s... 
11. Complex Coordination. ... 


EA mgòmp" 8890 
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examined to try to find clusters of points near the horizontal axis 
through which the hyperplane, which is perpendicular to the first 
trial vector, can be passed. (Rotations larger than 30 degrees 
are usually not advisable, and it is better to underestimate rather 
than overestimate the amount of rotation.) The rotated position 
of the plane has been drawn in on each plot of Figure 8.1. 

5. At +1.0 on each horizontal axis measure the vertical dis- 
tance between the horizontal axis and the rotated axis. This dis- 
tance is called s and is entered in the column so labeled in Table 
8.1 with proper sign. 

6. The method of obtaining the remaining values for trial 1 
in Table 8.1 should be fairly obvious: 


For row I, à — s = .61 — 30 = .31 
sà = .30 X .61 = .18 
1.00 — sa = 1.00 — .18 = .82 


7. The values in rows II through V are obtained similarly. 
The values in column / are for long vectors and must be normal- 
ized to yield the values in column A», which represent the values 
in the transformation matrix for the latest rotated position of the 
axis. The normalizing is done by obtaining $}, VSF, 1/V/3P. 
A partial check is that 2A? should equal unity. 

8. The cosine of the angle of the rotation can be determined 
by the vector product ZAmgùmp- For the first trial the value is .8890 
which is the cosine of an angle of about 27°. 

9. The second trial values are determined by the same method 
as the first; each row of the centroid-loading matrix is multiplied, 
in turn, by the column matrix Amp. The values are entered in 
the column for trial 2. The loadings for the second-trial values 
are plotted, in turn, against each centroid factor, as shown in 
Figure 8.2. Rotations seem to be called for in the plots of factors 
III and V. 

10. The values in column Amp for trial 1 are copied as the Amg 
values for trial 2 in the lower part of Table 8.1 and the steps for 


“(4 180) Suisn) poyyur ouvyd-apsurs Jo sanpea Jen) puoras 107 NOA g8 TUN 
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obtaining the values in the various columns of trial 1 are repeated 
in trial 2. It will be observed that Ang and l values are equal in 
rows (such as I, II, and IV) representing axes which have not 
been rotated. The values in the l-column are then normalized 
yielding the latest Amp values. The vector product SAmgùmp is equal 
to 0.9337 this time, representing the cosine of an angle of rotation 
of about 21 degrees. 

1l. The third trial loadings are obtained by multiplying each 
row of the centroid-loadings matrix by the latest column matrix 


Amp, Vip = ZAjmAmp- 

12. The process of obtaining trial loadings and plotting them 
against centroid-factor loadings is continued until no further rota- 
tions are indicated. It is recommended by Thurstone (1947c, p. 
222) that the process be continued until the final adjustment rep- 
resents a rotation of about 5 degrees (cosine of about 0.995 or 
more). The final column of trial values represents one column 
in the V-matrix of rotated oblique loadings, and the final Amp 
values represent one column of the transformation matrix (A).* 

13. To obtain the next column of the V-matrix and of the A- 
matrix, start by selecting a test that has a low loading on the first 
factor so as to avoid obtaining the same factor again. (In the 
example, test 2 with a loading of 0.05 on factor I was selected.) 
The procedure that was used to obtain the loadings on rotated 
factor I is repeated to obtain the second column of the V-matrix 
of oblique factor loadings and the calculations are illustrated in 
Table 8.2 and the plots in Figures 8.3 and 8.4. 

14. The scalar product of any two columns of the transforma- 


tion matrix will, of course, give the cosine of the angular separa- 

1 If the loadings on the primary axes (see p. 192) rather than on the simple axes 
are desired, the transformation matrix (T’) which will transform the centroid load- 
ings into primary-axis loadings can be obtained from the A-matrix by the follow- 


ing steps: 
(1) Write matrix 
(2) Find the inverse © l 
of Chapter 3, also illustrated i Se f 
(3) Normalize matrix A’—1 by obtaining the sum of the squared values in 
each row (312), the square root (V212), and reciprocal (1/ YZI?) and multiplying 
the values in each row of the matrix by the reciprocal value (d;) for row j. 
atrix is T’, the transformation to the primary axes and 


(4) The resulting m: s 3 i 
TT’ gives the cosines of the angular separation of the primary axes. 


A’, the transpose of matrix A. 
f matrix A’ (i.e, A’—1) by the method on pages 26 to 31 


n the next section. 
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TABLE 8.2. SINGLE-PLANE METHOD OF ROTATION: CAL- 
CULATION OF THE ROTATED OBLIQUE FACTOR LOAD- 
INGS FOR THE SECOND COLUMN OF THE vV-MATRIX 


wUbwn = 


25 oono 


EA mgòmp *-8821 


Trial 1 


El? =1.0515 


Ui? = 1 0254 
ÁZ = 9752 


Z? = 1.1306 
vi? = 1.0633 


T? = 9405 


*(g 3801 Suysn) pow ouxjd-a[Surs Jo sanyea per puosas 107 sIOJd 
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tion of the two factors. (In the example, SAmpiAmp2 = 0.27 which 
represents a separation of about 74 degrees between the first two 
simple axes of the oblique solution.) The remaining columns of 
the transformation matrix are obtained by the same procedure. 

15. The work can be checked by the formula F,F’, = 
V(A'A)—'V’, where F, represents the centroid-factor matrix and V 
represents the matrix of rotated, oblique (structure) loadings and 
A the transformation matrix obtained by the method outlined 
in this section. 


Direct ROTATION TO PRIMARY OR SIMPLE STRUCTURE 


The method of oblique rotation to be illustrated 
next was presented by Harris (1948c). At present it is perhaps of 
more theoretical interest than of practical value since it is a rela- 
tively new method and all of the problems of applying it to rota- 
tional solutions of large batteries have not been worked out. It 
has, however, considerable promise for the future not only be- 
cause it requires only one rotation but also because Harris (1949) 
has set up some models, based on projective geometry, which may 
result in new, improved criteria for evaluating rotational solu- 
tions. ‘The method as outlined by Harris lends itself most di- 
rectly to obtaining the loadings on the primary axes (at the inter- 
sections of the planes or hyperplanes) which are usually a useful 
set of loadings. If, however, the loadings on the simple axes 
(normals to the planes or hyperplanes), that is, the loadings that 
are appropriate to simple structure and that are more comparable 
to the loadings obtained by other oblique-solution methods, are 
the desired ones, they can be obtained by some additional com- 
putation as will be explained below. 

The first step in applying the method is to obtain the extended 
values of the centroid loadings (Thurstone, 1947c, Chap. XI). 
Extending the values of the loadings means extending the vector 
of each variable so that it has a projection or loading of +1.00 on 
the first centroid (or any other central) axis. Table 8.3 gives the 
extended values of the centroid loadings in Table 8.1. They are 


obtained by getting the reciprocal of the first centroid loading of 


2 See page 192 for an explanation of primary axes and simple axes. 
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each test (1/aj:) and multiplying all of the centroid loadings in 


row j by the reciprocal. 


Trace of 
Plone C 


Trace of ” Trace of 
Plane A Plane 8 


(Orthogonal 


to 
Axes IandI) 


Ficure 8.5 Representation of extended vectors for a three-dimensional problem. 


This method is most appropriate for problems in which there 
e loadings on the first centroid factor. Extend- 


are all high, positiv 
mensional problem so that they have 


ing the vectors in a three-di 
a projection of +1.00 on the first cen- 
troid axis essentially gives the projec- 
tion of the configuration on a plane fk 
perpendicular to the first axis. The Plane C 
plane intersects axis I at +1.00 as is 
shown in Figure 8.5. This has the 
effect of showing the three-dimen- 
sional configuration in two dimen- 
sions. If the vectors lie in three or- 
thogonal planes, the projection should 
be an equilateral triangle (see Figure mor HG Plot of Ginna 
8.6). If they lie in oblique planes, vectors for che problem in Fig- 
the projection may be a triangle with =s , 

unequal sides. The corners of the triangles represent the intersec- 
tions of the planes, which is where the primary axes are placed. 
In other words, if axes are drawn from the origin through the 


Trace of 
Plane A 
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corners of the triangle, they represent the primary axes and a 
transformation matrix for these axes can be written immediately 
by observing the projections of the points of intersection. If there 
are tests located at these points their extended or centroid 
loadings can be used to write the (long-vector) transformation 
matrix. 


Eleven-variable Example. The extended loadings of factors 
II through V in Table 8.3 are plotted two factors at a time as 


TABLE 8.3. EXTENDED ¢FACTOR LOADINGS OBTAINED 
FROM THE CENTROID LOADINGS OF THE ELEVEN.VARI- 
ABLE EXAMPLE IN TABLE 5.18 


Factor 


2.0000} 1.00 
1.7241] 1.00 
1.7544] 1.00 
2.0000] 1.00 
2.1277) 1.00 


2.0000] 1.00 
1.7544] 1.00 


3.3333 | 1.00 


2.0000] 1.00 


shown in Figure 8.7. The configurations to be expected from 
five-dimensions are not so obvious, although Harris (1949) has 
outlined the principles by which they can be determined. We 
will restrict ourselves, here, to trying to find the corners at the 
intersections of the planes, of the configurations. In Figure 8.74 
variables 2 and 6 seem to be at one corner, variables 7 and 8 at 


another corner and variable 9 at a third corner, with variables 
3 and 10 possibly indicating a fourth corner. 


Let us inspect the 
other plots to see if these corners show up. 
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In Figure 8.7) variables 7 and 8, 3 and 10, 9, and 2 and 6 seem 
to show up again as corners. 

In Figure 8.7c the corners seem to be at variables 7, 6, 9, and 11. 

In Figure 8.7d the corners seem to be at variables 7 and 8, 3 
and 10, 2 and 6, and 9. 

In Figure 8.7e the corners seem to be at variables 7, 10, 6, and 
11; and in Figure 8.7f at variables 11, 10, and 9. 


a 
-1.0] 


<a Se ese Se a SE a % 12 18 
(d) (e) (r) 


Ficure 8.7 Plots of the extended factor loadings in Table 8.3. 


If we select one variable to represent each corner of the five 
primary axes, the most consistent corners and also the most mean- 
ingful in terms of the nature of the tests represented are variables 
6, 7, 9, 10, and 11. Inspection of the graphs indicates that vari- 
ables 5 and 9 seem to appear in the same vicinity (as would be 
expected since they are both mechanical tests), but the high pro- 
jection of 9 on extended factor II tends to distort the figures if 
it is used as a corner. It was, therefore, decided to establish a 
point about halfway between the two variables, labeled point 
(5, 9), to represent the mechanical factor. It should be observed 
that once a point is placed in Figure 8.7a, its projections on axes 
II and III are established and when, for instance, the point is to 
be placed in Figure 8.7b, its projection on axis II is already set 
and its projection on axis IV only remains to be determined in 


that plot. 
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The extended-factor loadings of the variables or points used to 
determine the corners of the configuration (i.e., the primary axes 
at the intersections of the planes) are listed in the columns of a 
matrix labeled P’ in Table 8.4. Normalizing this matrix by the 


TABLE 8.4. STEPS IN THE CALCULATION OF THE TRANS- 


FORMATION MATRIX FOR THE OBLIQUE PRIMARY 
FACTORS 


Direction Numbers 
for the Primary 
Factors 


1.9745 


1.4052 1.6335 1.8243 1.3184 1.2028 
«7116 


Transformation Matrix 
(Direction Cosines) 
for the Primary 
Factors 


Cosines of Angular 
Separations between 
Primary Axes 


steps shown in the table yields a transformation matrix T’. If 
the loadings (structure values) on the primary axes, Rjp, are desired 
they can be obtained by multiplying the matrix of the centroid 
loadings by T’ (i.e., F.T’). Also the correlations among the pri- 
mary axes can be obtained from the matrix product TT’. 


TABLE 8.5. CALCULATION OF MATRIX T-1 (THE 


INVERSE OF MATRIX T) 


T y 


Enter the volues from motrix T in rows 1.7, 
the identity matrix in columns f through j, 


Row 2, 
Row 8. 
Row 14, 
Row 20. 
Row 26. 


Row 9. 
Row 3. 
Row 15. 
Row 21, 
Row 27, 


Row 16, 
Row 4, 
Row 10. 
Row 22. 
Row 28. 


Row 23, 
Row 5. 
Row 11, 
Row 17. 
Row 29 


Row 30. 
Row 6. 
Row 12. 
Row 18. 
Row 24. 


—1,4282 
“1.2116 


—.5370, 
~.2800| 


—4328 


13,19, 0nd 25, columns othrough o , ond enter 
os shown above. 


Divide 10 through 1f by 10. 
Boe 7070+ 20 20.0; Bön 7b- 70.2b;8¢ = e- Tor2e;. . - . OF ® Tf UA 
14or13a-130s 20=0.0:1465135-130° 2b)... 14513-130% PP. 
200819 0-19e+ 2220.0;2082196-190" Zb;-... 20219-190% BP- 
2602250-250+ 2020.0;26b256-250° 26;....26 9257-250 ar. 


Divide Bb through 8g by 86. 
Jon 26-26-96 = 0.0; 3ee2e-2b+9er3d" 2d- 2b-9d; «... 39°29 = 26499. 


1565146-146¢ 9b=0.0;1 5014c -14b> 9c; 15g "14g - be 99. 
21b520b-20b* 9620.0;21¢ =20¢-20b+ 9c j2td#20d-206+ 9 
276°266~-266* 96=0.0;27¢ #26¢-266* 9e 279 26g -260° 9g- 


21g = 209-200: 97. 


Divide 15¢ through 15A*by 15¢ + 
4c23¢ - 3¢116¢#0.0;4de30 -3 
10es 9c- 9er16e20.0;10d 9d- 9eeI6di 
22es21e-21e "16c 0.0;22d721d -21c +16 d: 
2Bce27e-27¢ +16¢70.0;2892279-276*16d; 


e+ 16d; 4e=3e-3e* 160; +... 4A33A-3c+16A. 
10h® 9h- 9¢ 16A. 
22h 22th -210 16h. 


284 827h = 27¢ 16A. 


Divide 22d through 22i by 22d. 
Soe 4d -4d923020.0; Sesdo- 4d*23¢5f* 4f-49+23f;, Siear =4g23i. 


11d210g-100+234=0.0;1 te=100-10d923 17 210/ = 109+23i. 
17d=16d- 16d +23¢= 0.0; Te 16e-16d+23¢ 177 2167 - 16d +237. 
294228d-28d «23d = 0.0; 2902280 ~28d+230% 29i 228i -289 +23. 


Divide 29e through 29/ by 290. 
Ge= 5e-5e+300=0.0; 6f- 5f- 50°30. 6g=5g-5¢ + 30g; ...- 6j” 5j-50*30j. 
12e 11e -110-300 0.0; 1272117 - 11e *30f; tayeirs= e307. 
18e217e-17e+30e=0.0;18f17f -170 307; 1842174 - 1704307. 
240e230-230+3000.0:24f 2237-230 +307; 2áje23j -23¢*30/. 


arene 
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If, however, the loadings (structure values) on the oblique 
simple axes, called a V-matrix, are desired, the following steps 
should be accomplished: 


1. Obtain and inspect matrix (IT’) to determine that the 
primary factors are not correlated to a greater extent than you 
are willing to accept in the rotational solution. (In the example 
the highest correlation is between P, and P;. The value .7057 
represents the cosine of an angle of almost 45° between the axes.) 

2. Get matrix T-1, which is the inverse of matrix T (the latter 
being the transpose of matrix T’). This may be done by the 
TABLE 8.6. STEPS IN THE CALCULATION OF THE TRANS- 


FORMATION MATRIX FOR THE OBLIQUE SIMPLE 
FACTORS 


(Inverse of the 
Transpose of 


Matrix T’) 


(Transformation Matrix 
forthe Simple 


Factors ) 


(Cosines of Angular 
Separations between 


Simple Axes ) 
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procedure for obtaining the inverse matrix described in Chapter 
3 (pages 28 to 30). A special work sheet and the procedure for 
obtaining each value for this 5 X 5 example are shown in Table 
8.5. The accuracy of the work can be checked by multiplication 
of TT-! = I (within rounding error). 

3. Normalize matrix T~1 by the steps shown in Table 8.6. 
This normalized matrix A = T~*D is the transformation matrix 
which will rotate the centroid loadings to the projections on the 
oblique simple axes. In other words, V = F.A. (See Table 8.7.) 
These rotated factors will be interpreted in the next chapter. 


TABLE 8.7. ROTATED FACTOR LOADINGS ON THE 
OBLIQUE SIMPLE AXES (V) 


Factor 


. Dial and Table Reading.... 
2. Spatial OrientationI..+.++- 
3. Reading Comprehension. «« 
4. Instrument Comprehension . 


5, Mechanical Principles....-|--03 | .02 55 | .03 | .09 


6. Speed of Identification . .. 


7. Numerical OperationsI... 


8. Numerical Operations... 


9. Mechanical Information .+ 


10. Practical Judgment. ss» 


11. Complex Coordination «+++ 


4. The correlations among the simple axes are obtained from 
the multiplication A’A = C (cosines of the angular separations of 
the axes). (See Table 8.6.) In general, the signs of the correla- 
tions between simple axes will be the opposite of the signs be- 
tween the corresponding primary axes. ; 

5. The loadings may be checked by reproducing selected, 
or all of the correlations and communalities by the formula 


R = VAA) V. 
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PROBLEMS 


1. In the illustration for the single-plane method of rotation the 
first two columns of the V-matrix and A-matrix for the eleven-variable 
example were obtained. Obtain the remaining three columns of each 
of these matrices. Also obtain the A’A or C-matrix. 

2. Apply the direct-rotation-to-primary-structure method of rota- 
tion to the centroid factors obtained in Problem 6 of Chapter 5, ob- 
taining the loadings on the primary axes (matrix R,,), the transforma- 
tion matrix for the primary axes (matrix T’) and the cosines of the 
angular separations of the primary axes (matrix ET"); 

3. Convert the T’-matrix found in Problem 2 above to the cor- 


responding ,q-matrix and compute the loadings on the simple axes 
(V-matrix). 


CHAPTER 9 


Interpretation of Factors 


_ AN rrer the rotated factor loadings have been 
obtained, an interesting next step is to try to identify the content 


and nature of the factors. This is done by inferring what the 
tests with high loadings on a factor have in common that is present 
to a lesser degree in tests with moderate loadings and absent from 
tests with zero or near-zero loadings. As with all scientific hy- 
potheses, these inferences need to be verified by prediction and 


further investigation. (See also pp- 196-197.) 


ELEVEN-vARIABLE EXAMPLE 


In a new area of investigation it would ordinarily 


not be feasible to identify five factors derived from only eleven 
tests. In this case, however, since the correlations are based on a 
large sample and the factorial content of these tests has been 
established in other factorial studies (see Guilford, 1947b, Chap. 
28) the results can be used to illustrate the procedure for identify- 
ing factors. The rotated orthogonal factor loadings in Tables 
7.1 and 7.10, and the oblique factor loadings in Table 8.7 will be 
used for this purpose since they all lead to the same interpreta- 


tions. 
Comprehension and Test 10, 


Only two tests (Test 3, Reading 

Practical Judgment) have high loadings on factor I of the orthog- 
onal solutions and factor IV of the oblique solution. The former 
149 
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test reflects individual differences in verbal comprehension. The 
second test presents verbal descriptions of judgment situations 
and then several alternative solutions are suggested. It is be- 
lieved that verbal comprehension is represented in the scores on 
this test also to a considerable extent. None of the tests with low 
loadings on this factor seems to require verbal comprehension, so 
the factor has been identified as a verbal-comprehension factor. 

The two tests with the highest loadings on factor II are tests 
of speed and accuracy of arithmetic computation (Test 7, Numer- 
ical Operations I and Test 8, Numerical Operations II). Test 1, 
Dial and Table Reading, which has a sizable loading on this factor, 
also involves the use and comparison of numbers in reading 
various dials and tables. Apparently this is a numerical-facility 
factor. None of the tests with low loadings seem to require 
numerical facility. 

Tests 2 and 6 have high loadings on factor III of the orthogonal 
solutions and factor I of the oblique solution. Test 2, Spatial 
Orientation I, requires the matching of a part of an aerial photo- 
graph with the whole picture. The perception of small detail is 
involved. Test 6, Speed of Identification, also involves the match- 
ing of small perceptual detail. Since both of these tests are 
speeded, the factor is identified as a perceptual-speed factor. Test 
1, Dial and Table Reading, has a moderate loading on this factor. 
Presumably, working with tables of numbers and dials requires 
some perceptual speed. None of the other tests seem to require 
appreciable amounts of perceptual speed. 

‘The two mechanical tests, Test 5, Mechanical Principles, and 
Test 9, Mechanical Information, are highest on factor IV of the 
orthogonal solutions and factor III of the oblique solution. The 
Mechanical Information test requires considerable mechanical 
knowledge and experience. The Mechanical Principles test de- 
pends somewhat less on mechanical background although that 


_ iit might be noted that, with only two tests high on it, the factor would be 
difficult to identify. Other things these two tests might have in common are judg- 
ment, reasoning, etc. The results of this one study would be insufficient to estab- 
lish this as a verbal factor. Other studies have shown that if a vocabulary test 
of the appropriate level of difficulty were added to this type of battery it would 


have a high loading on this factor and on this factor only, thus further establish- 
Ing it as a verbal factor. 
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type of background is helpful in obtaining the correct answers. 
Apparently these two tests have mechanical background as a com- 
mon source of variance, and the factor is identified as a mechan- 
ical-experience factor. Tests 4 and 10 also have moderate load- 
ings on this factor in the orthogonal solutions. Some of the 
items in Test 10 involve judgment in mechanical situations, which 
probably accounts, in part, for its moderate mechanical-experience 
loading. It may not be quite so obvious why Test 4, Instrument 
Comprehension, involves mechanical experience unless, somehow, 
the reading and interpreting of airplane instruments is related to it. 

Also, as was previously noted, mechanical and numerical tests 
usually have low negative correlations. Hence, the negative load- 
ing of Test 7 on the mechanical factor and the negative correla- 
tion of the numerical and mechanical primary factors. (See Table 
8.4.) 

Factor V has three tests (Test 11, Complex Coordination; Test 
4, Instrument Comprehension; and Test 1, Dial and Table Read- 
ing) moderately loaded on it. The nature of this factor was not 
obvious when it was first isolated. It is of special interest because 
it is common to both paper-and-pencil and performance tests. 
Much research has gone into identifying the common behavior 
tapped by these and similar tests. It seems to involve compre- 
hending spatial orientation with reference to the position of the 
observer and has been named a spatial-relations or spatial-orienta- 
tion factor. In the Instrument Comprehension test, for instance, 
the direction and attitude in which a plane would be flying has 
to be determined from the dials as they would be read in the 
cockpit. 

As can be seen from matrix C in Table 8.6, some of the factors 
are correlated and depart considerably from orthogonality in the 
oblique solution. The highest correlations are between the per- 
ceptual and spatial, verbal and mechanical, and mechanical and 
spatial factors. Inspection of the plots for the orthogonal solution 
(Figure 7.10) indicates that some of these factors, as represented 


2 Loadings of .2 or less are usually regarded as insignificant, loadings of .2 to 3 
as low, .3 to .5 as moderate, 5 to .7 as high, and above .7 as very high. This is, 
of course, an arbitrary classification and varies somewhat depending upon the N, 


the range of talent, the kind of correlation used, etc. J 
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by the tests in the battery, are correlated in this population. Ad- 
ditional examples of the interpretation of factors will be given 
in Chapter 10. 

After the factors have been identified with some certainty, it 
might be of interest to look at the content of each test individually. 


Test 1, Dial and Table Reading, seems to be a rather complex 
test, having appreciable loadings on the numerical-facility, spatial- 
relations and perceptual-speed factors. Its communality indi- 
cates that .62 of its variance is common-factor variance. This is 
somewhat short of its reliability coefficient of 0.87, the difference 
indicating specific variance, unaccounted for in this battery of 
tests. 

Test 2, Spatial Orientation I, has an appreciable loading on 
the perceptual-speed factor only. It seems to be a “pure” measure 
of this factor, although the discrepancy between its communality 
(.49) and reliability (.63) indicates the possibility of some poten- 
tial common-factor variance not brought out in this analysis. 

Test 3, Reading Comprehension, seems to be a relatively pure 
measure of the verbal-comprehension factor, although its low, 
possibly significant loadings on the numerical-facility and mechan- 
ical-experience factors may reflect mechanical and numerical con- 
tent in some of the items. 

Test 4, Instrument Comprehension, has loadings on the spatial- 
relations and mechanical-experience factors. Its communality 
(.40) is far short of its reliability (.84), indicating that considerable 
reliable variance has not been accounted for. 

Test 5, Mechanical Principles, has its high loading on the 
mechanical-experience factor. Its communality (.55) is also short 
of its reliability (.82). 


Test 6, Speed of Identification, seems to be a rather “pure” 


measure of the perceptual-speed factor, although the discrepancy 


between its communality (.44) and reliability (.76) leaves room 
for significant loadings on factors not isolated in this battery. 
Tests 7 and 8, Numerical Operations I and II, are pure meas- 
ures of the numerical-facility factor. Since their communalities 
approximate their reliabilities, most of their reliable variance is 
accounted for and these tests should remain pure in other anal- 
yses. The communality of Test 7 (.67) slightly exceeds its esti- 
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mated reliability (.64). This would not be expected from the- 
oretical considerations, and probably indicates that the reliability 
of the test has been underestimated. 

Test 9, Mechanical Information, has its only high loading on 
the mechanical-experience factor. Its communality (42) is also 
far short of its reliability (.85), less than half of its reliable variance 
having been accounted for by its common-factor loadings in this 
battery. 

Test 10, Practical Judgment, has its major loading on the 
verbal-comprehension factor and a secondary loading on the me- 
chanical-experience factor. 

Test 11, Complex Coordination, has a loading of .46 on the 
spatial-relations factor and low but possibly significant loadings 
on the perceptual-speed and mechanical-experience factors. Its 
communality (.36) is far short of its reliability (.91). If this test 
Were put into batteries with other tests, some of the reliable vari- 
ance which is specific in this battery might become identifiable 


common-factor variance. 


REPORTING A FACTORIAL STUDY 

As is true of all scientific reporting, it is desirable 
to give a complete account of a factorial study so that the results 
can be verified and the computations checked. The following are 
some of the items that should be included in a complete account 


of a factorial study: 


The Measures. The tests or other variables in the battery 
should be described as completely as possible. If the tests are not 
generally available, they should be placed on microfilm or made 
available in some other way. Scoring formulas, reliabilities, time- 
limits, and special conditions of administration should be stated. 
Distributions, means, standard deviations, and measures of skew- 


ness and kurtosis should be supplied. 


The Sample. The population from which the sample is drawn 
should be described as completely as possible in terms of back- 
h as age, sex, education, geographic location, 


ground variables suc : : 
The method of sampling and size of 


socio-economic level, etc. 
the sample should be stated. 
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The Analysis. The method of correlation should be speci- 
fied. The correlation matrix and matrix of extracted factor 
loadings should be given. The data for applying the criteria 
for sufficient factors and the table of residuals following the last 
factor extracted should be shown. Where the conservation of 
space is important, as in journals, it has been suggested that only 
one-half of the symmetrical correlation matrix need be printed 
and the other half of the table could be devoted to the final 
residual-correlation coefficients. 

The transformation matrix and matrix of rotated factor load- 
ings should be given. For oblique solutions the cosines of the 
angular separations among the axes (C-matrix), or correlations of 
the primaries (TT’) should be shown. For some purposes the 
loadings of the tests on the primary axes (Rip = FT’) should be 
furnished. 

For the convenience of the reader, a complete set of diagrams 
of the rotated axes should be supplied, and a discussion of the 
interpretation of the factors should be presented. 

Not all of the items mentioned will be needed in every report, 
and several items not mentioned may be needed in others. Where 
lack of space in a journal or other publication medium makes it 
impractical to report all of the data, the portion not published 
can be put on microfilm by the American Documentation Insti- 


tute, Washington, D. C., and thereby made available to those who 
desire to examine them. 


PROBLEMS 


1. Interpret the rotated factor loadings obtained in Problem 3 of 
Chapter 7. 


2. Interpret the rotated factor loadings obtained in Problem 1 of 
Chapter 8. 

3. Interpret the rotated loadings obtained in Problem 2 of Chapter 
8 and compare the interpretations with those made in Problem 1 
above. 

4. Interpret the loadings obtained in Problem 3 of Chapter 8 and 


al the interpretations with those made in Problems 1 and 3 
above, 


CHAPTER 10 


Applications in the 
Literature 


A GLANCE at the bibliography at the back of 
this book will indicate the extensiveness of the literature on factor 


analysis in recent years. The purpose of this chapter is not to 
review this voluminous literature, but to summarize a few studies 
which illustrate the application of some of the methods described 
in the preceding and succeeding chapters.* The studies sum- 
marized below have been selected because they illustrate a tech- 
nique, a point or principle in the theory of factor analysis, an 
application of the method to the solution of a theoretical prob- 
lem in science, or a combination of these. Dozens of other studies 
could have been selected which would have served these purposes 
equally well. Most of these investigations are much more exten- 
sive than the presentation here would indicate. Only the portions 
which illustrate the above-mentioned points have been abstracted, 
and the original references should be consulted where there is 


interest in the subject matter or in the theoretical implications. 


ys of the literature can be found in Vernon (1951), French 


1Some recent reviev 5 
ix (1950), and Carroll and Schweiker (1951). 


(1951 and 1953a), Royce (1950b), F: 
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FACTOR ANALYSIS OF CONDITIONED RESPONSE INDICES 


Humphreys (1943) set out to determine whether 
the concept of excitatory potential established in connection with 
the continued reinforcement of a conditioned response is unitary 
or multiple. The potential is not directly measurable but is une 
ferred from such measurable characteristics as frequency, ampli- 
tude, and latency of response. These characteristics can be meas- 
ured in both the acquisition and extinction periods of the condi- 
tioning. In addition to the measures listed above, two measures 
of sensitivity of the reflex on which the conditioning was based, 
were obtained. The unconditioned response used in this study 
was the human eyelid reflex to a puff of air directed 
cornea. The conditioned stimulus w 
amount of illumination. 
were obtained by several 
of 200 subjects in all. 


against the 
as an abrupt increase in the 
The light measures mentioned above 
investigators on five groups consisting 
The correlations obtained from the sepa- 
rate groups were averaged to obtain more reliable estimates of 
the various relationships. There were no consistent differences 


for any one group, and so it was considered to be justifiable to 
combine the correlations for the 


The intercorrelations for ei 
centroid method to three fact 
mined to be significant and r 
loadings, rotated loadings, a 


several groups. 

ght measures were factored by the 
ors, only two of which were deter- 
etained for rotation. The centroid 
nd communalities are presented in 
TABLE 10.1. ORTHOGONAL FACTOR LOADINGS OF STUDY 
OF EXCITATORY POTENTIAL OF 


HUMAN EYELID CON- 
DITIONED RESPONSE 1 


(Signs of Latency Variables Reflected) 


Variable T 

1 [Reflex to Puff.....,., 

2 | Reflex to Light sassis ë 

3 |Acquisition-Frequency , „ a 
4 | Acquisition - Amplitude, 62 
5 Acquisition -Latency ,,, -67 
6 Extinction- Frequency .. 45 
7 Extinction -Amplitude, , -43 
8 Extinction - Latency, ,., -41 


1 From Humphreys (1943, P- 107). 
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Table 10.1. A plot of the rotated factor loadings is shown in 
Figure 10.1. 

It was concluded that the concept of excitatory potential as 
applied to human eyelid conditioning is not unitary as had been 
previously assumed but is accounted for by two independent fac- 
tors. Other conclusions were that latency and amplitude are not 
measures of the same thing and that frequency in acquisition is 
somewhat different from frequency in 
extinction. 

On the basis of previous nonfactorial 
work in the area, factor I was identified 
as attitudinal, i.e., expectancy of stimulat- 
ing conditions. Factor II was believed to 
be physiological in origin and is defined by 
the two measures of sensitivity of the re- 
flexes and the amplitude of the condi- joa, 10.1 plot of ro- 
tioned responses. Variable 3, acquisition- tated factor eee 
frequency, is loaded on both factors and is ee rere Tei p. 108) 
influenced by both attitudinal and physio- 
logical factors. It contains the most “information” and might be 
useful if a single measure combining both sources of variance 
were desired rather than a pure measure of each of the two in- 
dependent factors. 


Is THERE A GENERAL-INTELLECTIVE FACTOR IN PERFORMANCE 
TEsts? - 

A group of thirty-four of the most commonly used 
performance tests was administered by Morris (1939) to determine 
whether they could be considered to be measures of a general- 
intelligence factor or, if not, how many and what types of factors 
were being measured. The battery was administered to fifty-six 
nine-year-old boys in a New York City grade school who were 
homogenous with respect to school grade, socio-economic level, 
and racial background and were above average in intelligence 
and in mechanical ability. 

Seventy-five scores were obtained from the tests, but only the 
thirty-three listed in Table 10.2 were selected for factor analysis. 
Those scores which had the largest standard deviations and were 
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the most objectively determined were considered most sititable 
for analysis. In order to minimize the number of negative cor- 
relations, scores based on time, errors, moves, and the like were 
reflected and are indicated by a negative sign preceding the vari- 
able number in Table 10.2. Even after reflection a number of 
the correlations remained negative, whereas a large number of 
the correlation coefficients, both positive and negative, did not 
differ significantly from zero, an indication that a single general 
factor could not be postulated for the correlation matrix. 


TABLE 10.2. ROTATED ORTHOGONAL FACTOR LOAD- 
INGS FOR AN ANALYSIS OF PERFORMANCE TESTS + 


Test Factor 
e 


— 1. More and Foal--Time ......, e.. 
— 2. Seguin Form Board--Time of Shortest Trial 
=~ 3. Five - Figure Form Boord -= Time . 
— 4. Two- Figure Form Board -- Time 
= 5. Casuist Form Board -- Time 
— 6. Triangle Test-- Time 
— 7. Diagonal Test-- Time . . , 
— 8. Healy Puzzle A-- Time, 
— 9. Manikin Test-- Time . , 
—10. Feature Profile-— Time, 
—11, Ship Test-- Time , myt 
12. Picture Completion Score 
—13. Substitution Test-- Time 
14, Adaptation Board Score . 
15. Cube Test -- Number Correct 5 
-16. Maxfield Color Cubes Design C--Time XPE 
—17, Witmer Form Board -- Shortest Trial Time awai 
~18. Lincoln Hollow Squure ~~ Total Time , aie wie ete 
—19. Witmer Cylinder -— Shortest Trial Time, 
~20. Dearborn Form Board --Total Moves, 
21, Porteus Maze ~~ Years Credit { MOBY sot 6 Rt bw ve 
22. Speed of Reaction--Dominant Hand , Number 
=23. Aiming -Dominant Hand, Total EPOP; a « 
. Eye-Hand Coordination Maze -- Time 
—25. Form Cancellation --Time , 
=26. Dot Counting -- Total Time . 


Se Sein 


~29, Square Block Form Board -- Dominant Ha 


Dominant Hand EE 
~32. Visual Acuity... a ore aa ye 


33. Revised Minnesota Paper Form Board -Total Score : 


1 From Morris (1939, p. 96). 
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The rotated loadings and communalities on three orthogonal 
factors are shown in Table 10.2. A fourth centroid factor was 
extracted, but it appeared to be not significant since only one 
loading contributed more than .10 to the test communalities. 

Factor I, although identified with the least confidence of any 
of the three factors, is believed to represent induction. High 
scores on tests loaded on this factor require the discovery of a pat- 
tern or rule of procedure. Thus, for the Dot Counting test (No. 
26, time score, and No. 27, error score, incidentally the only two 
experimentally dependent scores in the analysis), superior per- 
formance depends on the subject learning to form clusters to count 
the dots. Likewise in the Picture Completion and Aiming tests, 
patterns or rules must be discovered in order to achieve superior 
performance. 

Factor II is identified as a perceptual-speed factor. All of the 
scores loaded on the factor represent time or speed scores of one 
sort or another, and the factor is interpreted as representing the 
“ability to make a quick identification of simple perceptual mate- 
rial when mixed with like or similar material.” 

Factor III has a number of form boards and also the Re- 
vised Minnesota Paper Form Board (Test 33) loaded on it and 
is identified as a visualizing factor. “These tests all seem to in- 
volve an ability that requires the perception of form and space 
relations.” 

The communalities of some of the tests (e.g., Tests 6, 7, 8, and 
32) are remarkably low considering that they are correlated with 
other performance tests, indicating either low reliability or con- 
siderable specificity. The three factors were found to be orthog- 
onal to one another, and the hypothesis that the common variance 
of these tests could be accounted for by a single general factor 
which could be identified as general intelligence was rejected. 


LEARNING DyNAMICS IN BRIGHT AND DULL Rats 
Wherry (1941) has made an interesting application 


of factor analysis to the dynamics of rat maze learning using some 


data compiled by Tryon for 19 trials on a 17-unit multiple-T 
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maze.’ The subjects were 530 rats bred for “brightness” and 550 
rats bred for “dullness.” The 19 trials were divided into ten 
periods, and the percentage of total entrances into each alley for 
a given period was considered as the score for that alley. Thus, 
the correlation matrix consisted of the intercorrelations of the 
10 periods over a sample of 17 alleys. Two Pearson product- 
moment correlation matrices were obtained, one 
rats and one for the dull rats, and each one was f. 
centroid method. Three factors were obtained fr 
and rotated. Table 10.3 gives the rotated lo 


for the bright 
actored by the 
om each matrix 
adings and the iden- 


TABLE 10.3. ROTATED ORTHOGONAL FACTOR LOAD- 
INGS FOR BRIGHT AND DULL RATS1 


Bright Dull 


Factor 


Food- 
pointing 


Period 


Gool Forward 


Goal 
Gradient 


Gradient 


pointing 


SEMVauhUn = 


1 From Wherry (1941, p, 239), 


tification of the factors, The loadings for the three factors over 


Phed in Figure 10.2. 

ctor is interpreted as “. . . when an ani- 
n a given direction it will, if forced to 
ion, resume it u 


The food-pointing factor is interpreted as “. . . an animal 


2R. C. Tryon, “Studies in Individual Differences in Maze Ability. VII. Specific 
Components of Maze Ability and a General Th 


cory of Psychological Components,” 
J. Comp. Psychol., 1940, 80, pp. 283-335, k ° s 
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The goal-gradient factor is defined as “. . . alleys nearer the 
goal will be learned more quickly than those further away” 
(Wherry, 1941, p. 245). 

Inspection of Figure 10.2 indicates that, although the trends 
of the loadings of the factors over the entire learning period are 
similar for the two groups of animals, there are differences in the 
sizes of the factor loadings at given periods.* Thus, for the bright 
animals there is an earlier, steeper rise for the goal gradient and 


Bright Rats Dull Rats 
—— Loo ———=— 
Goal Gradient 


1.00 


Food Pointing 


-80 
a n 
£ = 60 
D 5 
3 š 
S 3 
s go ee 
E 3 ; Forward Goin 
È È 20// 9 


00 
TAs HFG 7S 910 
Learning Periods 
(6) 


00 


12345678910 
Learning Periods 
(a) 


10.2 Profiles of factors obtained by Wherry from Tryon’s maze data (see 
Table 10.3). (From Royce, 1950b, pp. 244-245.) 


FIGURE 


it remains the dominant factor in the middle and later periods. 
For the dull group, food-pointing is the dominant factor with 
goal gradient rising only toward the end. ‘There also seems to 
be a tendency for the bright rats to make more use of the forward- 
going factor in the later periods. If forward-goingness can be 
considered indicative of spatially-oriented behavior and goal 
gradient of insightful solution of the problem, the results can 
be interpreted to indicate that the genetically brighter rats are 
making greater use of “internally-released” hypotheses, whereas 
the duller rats are more dependent on external stimuli, such as 
food, for their learning hypotheses. It was also demonstrated in 
the study that the three factors isolated by Wherry could be used 
to predict the observed values as accurately as the ten components 


that Tryon considered necessary for this purpose. 


and similar trends over the learning periods were verified 
Cf. Wherry (1939) and Wherry (1940). 


3 The same factors 
on four other sets of animal data. 
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An ANALYSIS OF CRITERION MEASURES 


An attempt was made by Fruchter (1952b) to 
analyze the factor content of Air Force training criteria. If a 
criterion measure is treated as a variable and its correlations 
(validities) with the other variables in a prediction battery are 
obtained, it can be analyzed along with the other variables. This 
results in a description of the criterion in terms of the common- 
factor variance. (See Guilford, 1948b.) Such a description may 
be especially useful for understanding the nature and relative 
importance of the abilities called for by the criterion and for 
comparing profiles of abilities required for different courses of 
training. Thus, it might be found that occupations which are 
quite different at the level of skilled performance have a similar 
profile of aptitude requirements at the level of training. 

A battery of classification tests is routinely administered to 
airmen in the USAF Air Training Command, and the results 
are used to assign them to technical schools for training in various 
occupational specialties. To evaluate the effectiveness of the tests 
in selecting successful candidates for the training schools, the 
scores are validated against the final grades made in the technical 
courses. 

The following two alternative approaches might be used to 
determine the factor content of these final-grade criteria. 


1. Consider each technical specialty a separate population and 
perform a factor analysis of the intercorrelations of its criterion 
and the test battery, or, 

2. Consider the airmen a single population and perform a 
factor analysis of the battery on a representative sample, Then 
estimate the factor content of each training criterion on the basis 


of the factor loadings obtained from the combined population and 
the validity of the tests. 


The latter approach was adopted because the loadings from 
specialty to specialty would have a more common basis of com- 
parison and a prohibitive number of factorial analyses would not 
be required. 

A representative sample of 389 airmen who had been assigned 
to training in technical schools was drawn. Each school was 
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represented in the sample in proportion to the total number of 
airmen who had been sent into that type of training, and the indi- 
viduals, all male and mostly 18 and 19 years of age, chosen to 
represent each school were randomly selected. 

The rotated, orthogonal loadings of the nineteen tests used 
to classify the airmen are shown in Table 10.4. The six factors 


TABLE 10.4. ROTATED ORTHOGONAL FACTOR LOADINGS 
FOR STRATIFIED SAMPLE OF TECHNICAL SCHOOL 
STUDENTS? 


Sample: 389 Airmen Assigned to Technical Schools (Decimal points 
omitted throughout) 


Tests” ] 


41 62 12 14 23 06 
72 15 17 04 22 05 
59 31 17 00 22 38| 67 
07 26-02 33 23 48| 47 
14 18 00 27 64 16] 56 
01 -09 56 07 -07 -05 | 33 
25 25 16 39 12 32] 42 
22 74 O1 19 16 13] 67 
-01 15 53 22 10 16 | 39 
57 43 17-02 26 31| 70 
13 71 -05 20 36 08| 70 
49 32 19 38 19 25] 62 
29 69 05 10 28 23] 70 
06 53 04 18 59 19] 70 
o2 10 02 07 38 47] 38 
70 04 17 36 08 -18 | 69 
78 11 07 28 08 03] 71 
01 42 08 00 63 14 |60 
08 18 08 54 26 16/43 
N Vv SE P ME Vz 


+ Reading Vocabulary s. s.. 
. Arithmetic Computations 
. Arithmetic Reasoning .. 
+ Pattern Analysis... ++ 
- Mechanical Aptitude -2.. 
. Biographical Inventory #1 
. Memory for Landmarks . . 

- Background for Current Aff 
. Biographical Inventory #2 .. 
10. Arithmetic Reasoning .. 
11. Aviation Information. .« 
12. Dial and Table Reading. 
13. Reading Comprehension . 
14. Electrical Information . « 
15. Mechanical Principles . 
16. Numerical Operations #1. 
17. Numerical Operations # 2 
18. General Mechanics «+++ 
19. Speed of Identification s. sss essers 


ODIYOUVAWN = 


1 From Fruchter (1952b). 
2 Variables 1 through 4 are part scores o! 
test. Variables 6 through 19 are tests from 


f the AGCT. Variable 5 is also an AG 
the Airman Classification Battery. 


were identified as follows: I, numerical facility (N); II, verbal 
comprehension (V); HI, secio-economic background (SE); IV, per- 
ceptual speed (P); V, mechanical experience (ME); VI, visualiza- 
tion (Vz). 

The validity coefficients of the nineteen tests for the fourteen 
technical-school, final-grade criteria were obtained. The factor 
loadings of the final-grade criteria were computed by a method 
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for estimating the factor loadings of a variable not included in 
the original analysis but whose correlations with the other vari- 
ables in the battery are known (see Appendix). Since the airmen 
had been classified on the basis of the test-battery scores, there was 
considerable restriction of the assigned groups on the more valid 
tests. This tended to reduce the validity coefficients and the factor 
loadings estimated from them. The amount of restriction was 
estimated from the standard deviations of the scores of the airmen 
assigned to the various specialties and the estimated factor loadings 


of the criteria shown in Table 10.5 were adjusted to compensate 
for the restriction. 


TABLE 10.5. ESTIMATED FACTOR CONTENT OF TECHNI- 
CAL SPECIALTY TRAINING CRITERIA CORRECTED FOR 
RESTRICTED VARIABILITY? 


Sample: Graduates of the Technical Specialty Schools (Decimal points 
omitted throughout) 


Criterion Group 


Weather Observer 


64 
A &E Mechanic (Jet) 72 
Carpetiter: cs. stes 14 
Control Tower Operator 40 
Draftsman ss siasa 53 
Electrician... 2... 64 
Engineman Operator . . 26 
Clerk-typist.... . a 
Medicol Corpsman... seen... 47 
Radar Mechanic, General 87. 


1 From Fruchter (1952b), 


It will be observed that the criteria have a considerable range 
of communalities. The very low communalities (e.g., for car- 
penter, radio operator ACS, and engineman operator) are prob- 
ably due to the unreliability of these criteria, although no direct 
estimates of their reliabilities are available. It would be desirable 
to determine the reliability of the grading systems used in these 
schools. If the reliabilities should prove to be satisfactorily high, 
then, of course, the indication would be that there is considerable 
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specific variance in the criteria and that they are not sufficiently 
covered by the tests in the battery. 

It should be kept in mind in evaluating the loadings that the 
criteria are grades in a course of study rather than performance on 
the job. The two highest loadings for airplane-and-engine me- 
chanic (conventional) are on the mechanical-experience and vis- 
ualization factors; for the jet mechanic they are on the mechan- 
ical-experience and verbal factors. The curriculum and grading 
system of the latter school might be examined to determine 
whether the stress on verbal material is necessary or even desir- 
able. If a similar amount of verbal ability is not needed at the 
level of skilled performance, a number of airmen who might make 
skilled jet mechanics are not being sent into training because of 
the stress on verbal comprehension in the training curriculum. 
This type of loss of potentially useful workers can become a serious 
problem in situations where the supply of talent is small relative 
to the demand (see Dailey, 1952). 
_ The aircraft sheet-metal worker grades have their highest load- 
ings on visualization with a smaller loading on mechanical ex- 
perience, whereas the draftsman grades have their highest loading 
on mechanical experience with a somewhat lower loading on 
Visualization. These loadings might also serve as a basis for eval- 
Uating these two curricula, as more visualization variance might 
Well be expected in drafting grades and more mechanical experi- 
ence variance in a sheet-metal course. The high numerical load- 
ing for the clerk-typist criterion is also somewhat surprising. A 


relatively higher verbal loading was anticipated. 
The profiles of factor loadings might also be compared to 
determine the degree of similarity or difference among the criteria, 


and clusters of training specialties could be set up as a basis for 
establishing job families. 


HE STUDY OF 


Direcr ROTATION TO PRIMARY STRUCTURE IN T. 
IN STUDENT- 


PERsonAaLrry Factors RELATED TO SUCCESS 
TEACHING 

; Schmid (1950) sought to determine the relation- 
ships of such personality measures as the Washburne Social Ad- 
Justment Inventory, Mooney Problem Check List, and selected 
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scales of the Minnesota Multiphasic Personality Inventory, to 
one another and to measures of teaching aptitude, such as total 
and professional grade-point average, attitude toward teaching, 
judgment in selecting sound teaching techniques under special 
conditions, and a rating by critic-teachers of proficiency in prac- 
tice-teaching. The twenty-four scores listed in Table 10.6 were 
obtained on fifty-one male and fifty-one female students enrolled 
in a university school of education. The male and female sam- 
ples were treated separately for intercorrelational and factorial 
analysis. Since not all of the scores were available for all indi- 
viduals, the N’s for the correlations, computed from normalized 
distributions, varied. 

The analysis of the correlations derived from the female 
sample was made by a modified form of the multiple-group 
method (see Chapter 6) which consisted of the following steps: 


l. Estimation of the communialities, 


2. Determination of the number of common factors. 
3. Construction of an arbitrary, initial oblique structure. 


4. Rotation of this initial structure to a primary-axes solution. 


The communalities were estimated by the centroid formula * 
and the variables arbitrarily assigned to three groups in order to 
test the hypothesis that the rank of the correlation matrix is three. 
In grouping the tests, care was taken to obtain one axis on which 
all the variables would have positive projections, so that it could 
be used as a central axis for obtaining the extended-vector load- 
ings of the variables. The loadings on the oblique multiple- 
group axes were calculated and, also, the correlations among the 
oblique axes (C-matrix).* A test of the linear independence of 
the oblique axes was made by attempting to calculate the inverse 
of the C-matrix (i.e., C~*). If an inverse matrix cannot be com- 
puted (because some of the pivotal values become negative) or if 
some of the entries in the inverse matrix are greater than 1.0, the 
indication is that the factors are linearly dependent. This usually 
results where too many groups (factors) have been assumed rela- 
tive to the rank of the correlation matrix. A satisfactory inverse 


4See p. 88. 
5 See p. 147. 


Applications in the Literature 167 


matrix, however, was found for the three groups referred to above, 
and the projections on the three axes were extended and plotted. 
The plot revealed no clear-cut configuration for three factors so 
the process was repeated for four factors. The plots for the ex- 
tended-vector loadings for four factors seemed to indicate the 
possibility of five factors. However, the inverse of the C-matrix 
for five factors contained many entries greater than 1.0, and the 
plots seemed to indicate the presence of only four factors; so the 
hypothesis of five common factors was discarded and the hypothe- 
sis of four common factors accepted. 

The variables were formed, on the basis of the extended-vector 
plots, into four groups and the factoring process repeated. The 
inverse matrix (C-t) was computed and did not reject the hy- 
pothesis of four linearly independent groups. Examination of 
the three extended-vector plots, needed to represent the four fac- 
tors, indicated that the intersections of the hyperplanes, where 
the primary axes are located were close to variables 9, 10, 13, 
and 20. The loadings (structure values) of the variables on the 
primary axes were obtained by direct rotation to primary, oblique 
structure (cf, Chapter 8 and Harris and Schmid, 1950), Further 
support for the hypothesis of four factors was obtained by examin- 
ing the residual correlations. The standard deviation of the dis- 
tribution of residuals was found to be less than the standard error 
of a correlation coefficient of zero. 

The major loadings of the variables on the four factors are 
shown in Table 10.6. The factors were identified as: 


I. Problem-response Set 

II. Professional Maturity 

III. Introversion 

IV. Social Adjustment 

The highest loading of the Practice-Teaching Rating is on 
the introversion factor, the direction of the measurement being 
such that a low rating is associated with introversion. The load- 
ings on the professional-maturity factor indicate that grades and 
the professional-teaching judgment of female students are re- 
lated to age and interest patterns characteristic of women; female 
students tend to reflect the same attitudes toward teaching as they 
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TABLE 10.6. MAJOR LOADINGS ON OBLIQUE PRIMARY 
AXES IN SCHMID’S STUDY? 


r Female Sample Male Sample 
Variable T Il]mlwy I I 

1. Washburne Control ... 
2. Washburne Purpose , 65] 62 
3. Washburne Sympathy... .......002. .46 50 
4. Washburne Alienation ......,,.., Al 62 
5. Washburne Impulse~Judgment......, P -40 
6. Washburne Subtotal Score... 2... LL, -90| 1.00 
7. Woshburne Yes—response.........,.,).43 37 
8 .?Professional Grade—Point Average... a š 54 36 
9. Total Grade-Point Average .. 2... 3 .32 
10.2Best's Teacher Attitude Test . US tas tai see 55] 43 
MAS AG 6 a 3 5 3 ae TE hh eso Ge ek ee co .43 -33 
12. Mooney Finances, Living Conditions and Employment . | .62 44 
13. Mooney Social and Recreational Activities eee | 78 +48 
14, Mooney Social- psychological Relations... , . | 78 -85 
15. Moon-_ --al-psychological Relations . . |. | 52 .96 
16. Mooney Adjustment to College Work |... 2 | 66 53 
17. Mooney The Future: Vocational and Educational 59 66 
18. Practice Teacher RENOD s a wie us ar ea .49 
192Minnesota Multiphasic Inventory Hypomania . , 37 
20. Minnesota Multiphasic Inventory Masculinity~ , 

Femininity Interest... 2. aau, a Tavs 48 
21, Minnesota Multiphasic Inventory Depression . .97 -30 
22. Minnesota Multiphasic Inventory Social . , 

Introversion-Extroversion... 2... eka 68 a4 
23. Wright Professional Judgement Examination ETF -47 
24: Mooney Total Number of Problems . tee ee | 89 [ee ES 


1From Schmid (1950). 
2 Reflected. 


have toward society in general. 


It was also observed that the sub- 
scores of each personalit 


y inventory tend to be separated, facto- 
rially, by inventory rather than yielding factors that cut across 
personality areas. The correlations among the primary factors 
(not shown in this summary) indicated no relationship among the 
problem-response-set, professional-maturity, and social-adjustment 
factors. There is a negative relationship between the response-set 
and introversion factors, Professional maturity is independent of 
introversion and negatively related to social adjustment, whereas 
(poor) social adjustment was highly correlated with the intro- 
version factor. 

A procedure similar to the one outlined above was followed in 
analyzing the correlations derived from the male sample. Two 


~h 
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nearly orthogonal factors fitted the data best, and gave small 
residual correlations. The two factors, whose major loadings are 
shown in Table 10.6, were identified as follows: 


I. Social and Educational Adjustment 
II. Personality-psychological Factor 


Factor I was interpreted as indicating that general social ad- 
justment and professional attitude and achievement are related 
factorially for male students. Apparently a male student who 
gets good grades in education and general courses is likely to 
possess a favorable attitude toward teaching and to be rated high 
in teaching skill. The variables loaded on the second factor are 
all measures of personality or are psychological in nature. 

The differences in the results of the two analyses were attrib- 
uted largely to sex differences, although instability due to small 
N's may be partly responsible. In interpreting the results of this 
study it should also be borne in mind that not all of the variables 
are experimentally or linearly independent (see Guilford, 1952a). 
Several of the sub-scores of the personality inventories have items 
keyed in common, and the correlation of sub-scores with total 
score leads to spuriously high part-whole correlation which may 
result in spuriously high loadings and communalities for the 


variables involved. 


RELATIVE ROLES OF CENTRAL AND PERIPHERAL FACTORS IN 
Prrcu DISCRIMINATION 

In an attempt to establish the relative roles of 
peripheral and central factors in pitch variation, Jeffress (1948b) 
has advanced a theory of pitch perception which hypothesizes a 
mechanism in the central nervous system that is activated only 
when both ears are stimulated simultaneously. Tones heard 
through one ear only are believed to be represented centrally 
through an entirely different set of pathways, one for each ear. 
When both ears are stimulated, there is, therefore, a threefold 
representation of pitch, by way of the pathways for the left and 
right ears separately, and by way of the central mechanism which 
responds to the simultaneous stimulation of both ears. If all of 


a 
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these mechanisms exist, the pitch of a tone heard through both 
ears should be some sort of average of these three heard pitches. 

It had been observed previously that there are day-to-day 
changes in the tuning of each ear when stimulated alone. ‘Thus 
if one ear were to rise in pitch while the other car remained the 
same, the tones heard through both ears simultaneously would 
be expected to rise also, although not as much, because the tone 
heard is expected to be an average of the three pitches. 

In order to test the expected outcomes of this theory, Jeffress 


correlated day-to-day changes in pitch discrimination for the fol- 
lowing three conditions: 


1. Monotic. Stimulation of one ear alone. 

2. Diotic. Simultaneous stimulation of both ears by the same 
stimulus (pure tones of the same frequency, intensity, and phase). 

3. Dichotic. Simultaneous stimulation of both ears by differ- 


ent stimuli (pure tones of the same frequency and intensity but 
of different phase). 


Table 10.7 shows the rank- 


š . 3, r 
difference correlation coefficients fo 
the following comparisons: 


l. Monotic-diotic: The standard tone to the left ear, the vari- 
able tone to be matched to it in both ears (Ls Dv). ? 

2. Monotic-diotic: The standard tone to the right ear, the vart- 
able tone to be matched to it in both ears (Rs Dv). ; 

3. Dichotic-diotic: Standard to both ears with the left leading 


» variable tone to be matched to both ears in phase 


5. Monotic-monotic: Aver 
rately (LR Av). 

6. Dichotic-dichotic: Avera 
half of the trials and the ri 
half (LI RI Av). 


age of the two ears stimulated sepa 


ge of the left ear leading in phas¢ 
ght ear leading in phase in the other 


The correlation coefficient: 


S were factored by the centroid 
method and the two factors we 


re rotated in an attempt to separate 
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central and peripheral factors. The rotated loadings are also 
shown in Table 10.7 and are graphed in Figure 10.3. Since a 
change in the tuning of one ear would be expected to influence 
all tones with which it was averaged, a factor with loadings of 


TABLE 10.7. CORRELATION AND ROTATED ORTHOG- 
ONAL FACTOR MATRIX FOR DAY-TO-DAY VARIATIONS 


IN PITCH? 
3 Rotated Factor 
Correlations Loadings 

Variable 4.5 E ma 
Ls Dv -=50 39 -.57 —=37 -.07 wh <2 33 
Rs Dv =.36 .28 .61 -.16 |-.8 2 -68 
Lis Ov =55 =.3t ~ 6 4 “2 
Rls Ov 07 10 —6 s? -85 
LR Av -25 |-6 4 352 
A) 8. 210. 


LIRLAvY 


1From Jeffress (1948b, p. 479). 


the same sign for the monotic-monotic (variable 5) and monotic- 
diotic comparisons (variable 1 and variable 2) would be expected. 
Factor II shows this expected relationship, since the loading of 
the monotic average has the same "e 

sign as the diotic comparisons. 

The effect on the dichotic com- 

parisons is not so obvious. Factor 

I is interpreted as also represent- 

ing the effect of peripheral change. i 
It shows a tendency for changes 

in the standard to the left ear and 

the left-leading dichotic compari- 

sons to go together and in opposi- 

tion to the standard to the right 0 

ear and right-leading dichotic Ficure 10.3 Plot of rotated loadings 
changes. If a change in one of for ons rate 10). in (pitch 
the ears (say the left one) affects 


diotic and dichotic comparisons equally, ve no 
on their relative positions on the factor. If, however, it raises the 


left-leading dichotic tone more than the diotic, and diotic more 
than the right-leading dichotic tone, all loadings on the factor 


it should have no affect 
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would be expected to be positive. The loadings on factor I seem 
to indicate that a rise in pitch of the left ear has the greatest effect 
on the right-leading, next to the left-leading, and least on the 
diotic tone. 

Is there any evidence of a central factor? Such a factor should 
reduce the magnitude of all Pitch differences. Such a tendency 
would result in a factor having loadings with the same sign for 
the monotic-diotic comparison with the standard applied to the 
left ear, and loadings of the opposite sign for the changes in the 


other comparisons. No rotated position of the axes revealed such 
a factor. 


Another possibility, sugge: 
found between the monotic t 
that the two types of tones ha 


sted by the large disparity commonly 
ones and the other tones, however, is 
ve become separated. If all the fibers 
were to change their pitch-assignment by the same amount, non- 
zero Opposite-sign loadings for the diotic-comparison changes and 
zero loadings for the other variables would be expected. If, how- 
ever, the fiber-assignments for the diotic tones were to shift more 
than those for the dichotic tones, but the latter were to change 
in the same direction as the former, the diotic changes would have 
higher loadings and the left standard and left-leading comparisons 
would have one sign while the right-standard and right-leading 
comparisons would have the Opposite sion, That is approximately 
the way in which factor I behaves, except for the loadings of the 


aller than anticipated. There is then 
nce for a possible central-mechanism 


SECOND-ORDER ANALYSIS OF REASONIN 


G TESTS 


British factor analysts who follow the lead of Spear- 
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TABLE 10.8. FIRST-ORDER AND SECOND-ORDER FACTOR 
LOADINGS FROM RIMOLDI’S STUDY OF REASONING 
TESTS? 


Second-order 


Rotated Oblique Factor | Factor Loadings 


Tests 


1 | Geometrical Forms No.1. . m05 

2 | Geometrical Forms No2. « 02 BDR |e 
3 | Number Series 2... 0 + .04 55 we Be 
4 | Verbal Analogies M r Eg 16 
5 | Pedigrees 02 44 |-07 |-.32 
6 | Inventive Synonyms . 02 33 |-10 -04 
7 | Groupof Figures s... 03 45 | 24| 13 
8 | Classification of Figures .. 10 64 ‘0s l-07 
9 | Arithmetical Reasoning... 02 49 ‘08 o6 


10 | Absurdities 
11 | Group of Letters 
12) Code a 
13 | Secret Writing 
14 | High Numbers 
15 | Numerical Judgmen’ 


16 | Three Higher. . N 14 62 | .02 | .00 
17 | Letter Series . ; .09 51 |-.07 | .05 
18 | Directions code 01 28 | 15 |-02 
TO ARGOS: sacra 9 serie) 0 © yen 51 


20| Form or Pattern Analogies. 
21 | Number Pattern»... += 
22 | Inventive Opposites ... + 
23) Reasoning»... ++ ++ = 
24] Reasoning and Inferences No.1 
25| Reasoning and Inferences No.2 | .07 |.01 


1 From Rimoldi (1951, pp. 95-99)- 


is comparable to the general-intellective factor. To test this hy- 


pothesis Rimoldi (1951) administered a battery of twenty-five rea- 
soning tests to 384 children between the ages of eleven and four- 
teen from twelve schools of a large city in Argentina. The inter- 
correlations of the variables were analyzed by both the multiple- 
group and two-factor methods. The loadings on both the general 
factor and seven rotated oblique factors are shown in Table 10.8. 
The primary factors were identified as follows: 


I. Bringing together of several conflicting gestalts (plasticity) 
II. Not interpreted (but possibly Number) 

II. Finding relationships 

IV. Finding likenesses 
V. Eduction of correlates 


VI. Verbality 


VII. Globalization (bringing parts together for a solution) 
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The intercorrelations of the seven primary factors are shown 
in Table 10.9 and were analyzed to three centroid factors. The 


TABLE 10.9. DATA FOR SECOND-ORDER ANALYSIS 1 


Correlations among the Primary Factors 


First-order Factor 
I | Plasticity in Reasoning, . 
I | Unidentified. n... .40 
pig Finding Relationships Pem 32 
I | Finding Likenesses , Pi =14 
Y | Eduction of Correlates |. 4 
oC -47 
XI | Globalization...) | 1.00 


Loadings of the Primary Factors 
on the Rotated Oblique Second-order Axes 


Correlations among the Second-order Factors 


reasoning tests is essential] 
tained by applying Spearman’s methods, 
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factors are needed to explain the intercorrelations of the primaries. 
The second-order centroid factors (not shown in the tables) were 
then rotated and the loadings of the first-order primary factors on 
the rotated, oblique, second-order factors, as well as the correla- 
tions among the second-order factors, are shown in Table 10.9. 

Primary factor IV has the highest loading on second-order 
factor A, which is orthogonal to the other second-order factors. 


Primary factors I, III, and V 
show smaller loadings on this 
factor. Finding likenesses and 
Opposites seem to characterize 
the element common to the 
primaries having loadings on 
this second-order factor. 
Primary factors III, V, and 
VII are loaded on second-or- 
der factor B. They represent 
finding relationships, educa- 
tion of correlates, and global- 
ization of parts; and second- Ficure 10.4 Plot of g-loadings and load- 
order factor B was interpreted "8° oe After Rimoldi 1961; pe rie 


as referring “to the mecha- 
nisms involved in dealing with relations and the education of 


Loadings on unrotated Factor -A 


O .20 .40 .60 80° 1.00 
g-Loading 


correlates.” 
Primary factors VII, VI, II, and I are loaded on second-order 


factor C. The common element here seems to be the synthesizing 
of parts into a solution, and the factor was therefore interpreted 
as representing the “synthetic process, including here all the pos- 
sible reshaping and redistribution of the elements and the use 
of the ‘instruments’ by means of which this process is accom- 
plished.” 

The results seem to support the hypothesis that the g-factor is 
nonunitary since six of the seven first-order factors have appre- 
ciable loadings on it. Also, the general factor can be derived 
either directly from the variables or through the intermediate step 
of obtaining the correlations among the first-order oblique factors 
and then factoring them. It seems to be indicated by this and 
other studies that the level at which a factor is brought out de- 
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pends on the complexity of the variables and the method of anal- 
ysis. Factors which are brought out in the second, or a higher 
order in one analysis may be brought out in the first-order in 
another analysis by the appropriate selection of variables. 


OBVERSE ANALYSIS OF SUPREME Court Vorinc RECORDS 


By far the most frequent type of correlational 
study has been the correlation of a series of measures over a pop- 
ulation of persons (referred to as R-technique). There are, how- 
ever, a number of other possibilities such as the correlation of a 
series of measures applied to one person over a population of 
occasions (P-technique) and the correlation of a series of persons 
over a population of measures (Qtechnique).° Analysis of a 
matrix of Q-correlation coefficients has been variously referred 
to as inverted, inverse, or obverse factor analysis 
tests and persons are interchanged as 
conventional R-technique. 

An interestin 
to determine wh 


, since the roles of 
compared with the more 


§ application of obverse factor analysis was made 
ether the voting records of the nine U. S. Supreme 
Court justices on a sample of 115 cases decided during 1943-1944 
sessions gave evidence of any groupings of the Supreme Court 
justices. (Thurstone and Degan, 1951.) Only those cases in 
which there were at least two dissenting Opinions were included. 
Intra-class phi coefficients were computed between each pair of 
judges on the basis of agreement with, or dissent from, the major- 
ity decision. A Positive coefficient indicates that two judges tend 


to agree in their votes. A negative coefficient indicates that they 
tend to disagree. 


Three centroid factors 


were extracted from the correlations 
and the communality for 


each judge was computed (see Table 
10.10). There was a large range of communalities, and the voting 


of those judges with the highest communalities could be predicted 
best from the three common factors. 


The oblique rotations ar 


®See pp. 202-204, 
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TABLE 10.10. FACTOR LOADINGS OF VOTING RECORDS 
OF NINE SUPREME COURT JUSTICES DURING 1943-1944 
(N = 115 DECISIONS) 


Rotated Oblique 
Factor Loadings 


75 -63 


Centroid 
Factor Loadings 


Justice 
Black 


1 

2] Douglas 

| 3| Rutledge =M 
4| Murphy =39 
5| Reed -02 
6| Jackson -64 
7| Stone -.04 
8 | Frankfurter .27 
9 | Roberts M2 


1 From Thurstone and Degan (1951, p. 5). 


of Justices Douglas, Black, Roberts, and Frankfurter. In Figure 
10.5b the trace of the plane orthogonal to factor HI has been 
moved into the position determined by the vectors representing 
Justices Douglas, Black, Reed, and Roberts. In Figure 10.5¢ 


Jackson 
Fronkfurter 


„Roberts 


Sheed 
“Douglas. Rutledge 
Block™ 

Rutledge 


Murphy 


Murphy 


Ficure 10.5 Plots of rotated oblique factor loadings from Table 10.10. (After 
Thurstone and Degan, 1951, pp. 6-7.) 


there is a further adjustment of factor II relative to factor II. 
In Table 10.10 the oblique factor loadings (V-matrix) are given, 
and the transformation matrix (A) and cosines of angles between 
the oblique axes (C-matrix) are shown in Table 10.11. The 
authors point out that this is not the only possible solution and 
that other locations of the reference frame could be justified. 
As is true in all areas of investigation, interpretations cannot 
be made automatically. An expert in the subject-matter field is 
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ABLE 10.11. TRANSFORMATION MATRIX (A) AND CO- 
= SINES OF ANGLES BETWEEN REFERENCE AXES (C) + 


1 From Thurstone and Degan (1951, p. 5). 


needed to interpret the results, Factor I represents something 
which is shared in common by Justices Black and Douglas and 
which is the opposite of something shared by Justices Frankfurter 
and Roberts. Factor II represents something common to the 
points of view of Justices Reed and Stone but Opposite to what a8 
common to Justices Murphy and Rutledge, Justice Jackson 1s 
not represented on factors I and II. 

Justice Jackson is strongly represented on factor III and Jus- 
tice Frankfurter to a lesser extent, while the points of view of 
Justices Rutledge and Murphy have negative projections on this 
factor. The entire configuration of test vectors as viewed in three 
dimension also seems to suggest that there are actually 5 distinct 
but not unrelated points of view represented. These are (1) Reed 
and Stone, (2) Murphy and Rutledge, (3) Black and Douglas, (4) 
Frankfurter and Roberts, and (5) Jackson and, to a lesser degree, 
Frankfurter. 

Extracting the factors from a correlation matrix is a relatively 
straightforward matter, Rotating to a meaningful position and 
interpreting the nature of the factors, however, requires consider- 
able knowledge and insight in the field under investigation, as has 
been mentioned above. The authors, therefore, discussed theit 
results with specialists in political science and law. There was no 
general agreement concerning the interpretation of the factors, 
although the authors report they were surprised to observe that 
the discussions centered around the personalities of the judges 
rather than around political, €conomic, or legalistic concepts such 


as civil liberties, economic liberalism, or the rights of the indi- 
vidual. 
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Another method of interpreting the factors in this type of 
study might be to list the decisions on which all of the judges on 
a factor agreed or disagreed and to interpret the factor on the 
basis of the common principle which the agreements or disagree- 
ments seem to embody. Also to be considered in the interpreta- 
tion are the possible divergent reasons for dissenting to a majority 
decision and the relative frequency of each type of case considered. 
The authors felt that this type of interpretation should be done by 
specialists in the field such as lawyers or political scientists. Pend- 
ing such further study, no final interpretation of the nature of the 
factors was attempted. 


Osversr FACTOR ANALYSIS OF PREPSYCHOTIC PERSONALITY 
TRAITS 

Another application of obverse factor analysis has 
been made in the study of prepsychotic personality traits in pre- 
sumably normal female subjects. (Moore, Stafford, and Hsu, 
1947.) A personality inventory, consisting of 128 questions con- 
cerning prepsychotic and normal mental hygiene traits, was ad- 
ministered to fifty-six juniors and seniors in a Catholic woman’s 
college. The subjects were instructed to check those items to 
which they would answer “yes.” Thirty-eight of the items were 
checked by more than fifty or fewer than six girls and were, there- 
fore, considered to be nondifferentiating. Tetrachoric correla- 
tion coefficients were computed between every pair of girls on the 
basis of their replies to the remaining ninety items. Thus, since 
the variables are persons and the sample consists of the tests (i.e., 
items), a 56 x 56 matrix of correlation coefficients was obtained, 
each coefficient based on a sample of ninety traits.’ Eleven factors 
were extracted by the centroid method and rotated to simple, 
oblique structure. ‘The factors were identified by studying the 
of a person to her loading on a factor. 
igh loadings on a factor gave the same 
sed to help interpret 


relationship of the answers 
If all of the persons with h 
answer to a question, that question was u 


— 
_ 7 Q-technique. Correla! 
tions and factor analyzed. 
to say that the authors consi 
from the ninety-item study. 


from the replies to all 128 ques- 
ill not be reported here except 
analysis less satisfactory than 


tions were also computed 
This aspect of the study w. 
dered the results from that 
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the nature of the factor. The following two questions were ai 
swered affirmatively by all thirteen girls who had high loadings 
on the first factor: 


Have you carried out to some significant degree your 
first seriously adopted plan of life? . 

Are you dominantly interested in things going on 
around you in the world in which you live? 


The first factor is identified as “normal, healthy type of ad- 
justment.” 


Traits held in common by the twelve girls with loadings on 


the second factor are indicated by affirmative answers to the fol- 
lowing questions: 


Do you prefer tasks that 
to work at? 

Are you dominantly interested in things going on 
around you in the world in which you live? 

Are your plans and Wishes mainly concerned with find- 
ing someone whom you can love with all your heart? 

Are your plans and Wishes mainly concerned with find- 


ing someone who will love you with deep, abiding devo- 
tion? 


are clearly outlined and easy 


The second factor is identified 
mantic type of adjustment. 


: ; ; B . e 
Traits held in common by the seven girls with loadings on th 


x * a . 4 ing 
third factor are “self-consciousness” and accusing others of doing 
some of the following things: 


as a normal, extraverted, ro- 


- ++ being mean to others and to her; 


interpreting loosely 
moral conduct; 


lacking frankness; cheating on exam- 
ring; being immoral, lying; purposely flat- 
; being disloyal or unfaithful; purposely plac- 

Trassing situations; sponging on others; 
cheating; being insincere; being ungrateful and selfish; 
being thoughtless; being conceited: separating friends; 
being arrogant; failing to appreciate qualities in herself or 
in others; being narrow, Stupid, provincial in outlook; 
trying to bully others; trying to “crush intelligence with 


organized stupidity”; gossiping and slandering; having ul- 
terior motives; being hypocritical, 
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The third factor is identified as a social type of maladjustment 
with ideas of reference, projection, and self-consciousness, and 
was labeled by the authors as an “inferiority factor.” 

Traits held in common by the two girls with high loadings on 
the fourth factor are as follows: 


Has carried out to some significant degree her first 
seriously adopted plan of life; life and actions are guided 
at present by a well-defined purpose; prefers tasks that are 
clearly outlined and easy to work at; prefers tasks in which 
one is very likely to succeed; music, literature, and art afford 
her passive enjoyment rather than active accomplishment; 
if a person is widely admired, she reserves her own judg- 
ment until she knows more about him. Is dominantly in- 
terested in things going on around her in the world in 
which she is living; plans and wishes mainly concerned 
with finding someone who will love her with deep, abiding 
devotion; plans and wishes concerned mainly with finding 
someone whom she can love with all her heart; her fear of 
doing wrong is stronger than her fear of losing love or not 
being loved; has accused others of lying, stealing, cheating, 
being sneaky or not being righteous; wants to do something 
of distinct value for human beings or for the welfare of her 
own country; wants to do something that will contribute 
to God's plan for the Universe; has peculiar abnormal 
fears; is inclined to become excited over what others would 


consider trifles; is self-conscious 


The fourth factor was not considered as clear-cut as the first 
three but was tentatively identified as “simple-maladjustment.” 
It seems to present a fairly good picture of an introverted, sensitive 
type of personality maladjustment with perhaps some underlying 
conflict between principles and desires. It is, of course, only a 


doublet.s 
The four girls w 
following traits in common as revea 


questionnaire: 


— 
8A factor with only two variables with high loadings on it is referred to asa 
“doublet,” a factor with three high loadings on it, as a “triplet. 


ith high loadings on the fifth factor had the 
led by their answers to the 
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Plans and wishes mainly concerned with finding some- 
one she can love with all her heart; plans and wishes mainly 
concerned with finding someone who will love her with 
deep, abiding devotion; has vigorous type of personality 
entering the fight rather than seeking shelter; first thought 
seriously about what she was going to do in life at 13 or 
older; has entertained three or more plans of life seriously 
since her 15th year; lacks self-confidence; likes to be with 
people constantly rather than apart by herself. 


The fifth factor was identified as a romantic-extroverted type 
of adjustment. 

There were only two traits common to the seven girls who 
had high loadings on the sixth factor, namely: “plans and wishes 
mainly concerned with finding someone who will love her with 
deep, abiding devotion” and “ready to take vigorous steps to 
carry out her own plans.” The interpretation of this factor is 
not very clear, but, when traits which are claimed by six of the 
seven girls are also considered, a 
justment seems to be indicated. 


The three girls having high loadings on the seventh factor had 
the following traits in common: 


frustrated-romantic type of ad- 


Prefers music, literature, and art to less aesthetic pleas- 
ures; is self-reliant rather than dependent on aid and com- 
fort received from others; is ready to take vigorous steps 
to carry out her own plans; first thought seriously about 
what she was going to do in life at 13 or older; began to 
plan and work to attain her ambition at 16 or older; has 
entertained more than three plans of life seriously since 
her 15th year; wants to do something that will contribute 
to God’s plan for the Universe; is easily influenced; given 
to tears with glistening eyes; is self-conscious; likes to study; 
likes to be alone. . 

The seventh factor seems to re 
type of adjustment and is identifie 
type” of person. 

The eighth factor is another doublet, ‘The traits common to 


the two girls with high loadings on it are indicated by the follow- 
ing: 


present a thinking, introverted 
d by the authors as a “schizoid 
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Has a tired, worn-out attitude; suspects others of evil 
intentions; is unscrupulous; speaks of having vivid dreams; 
has had nightmares; given to tears with glistening eyes 
and tears rolling; usually tries to take added responsibili- 
ties on herself; likes to be alone; standing is above the 
average in school; her conscience bothers her more than 
other people seem to be bothered by theirs; wants to be 
engaged in tasks that require a great deal of ambition for 
their accomplishment; music, literature, and art afford her 
passive enjoyment rather than active accomplishment; 
plans and wishes are mainly concerned with finding some- 
one whom she can love whole-heartedly; plans and wishes 
are mainly concerned with finding someone who will love 
her with a deep, abiding devotion; dominant craving is to 
love; fear of doing wrong is stronger than her fear of losing 
love or not being loved; the conflict with conscience in her 
mind is more acute than in average individuals; is ready 
to take vigorous steps to overcome opposition; is ready to 
take vigorous steps to carry out her own plans; helps and 
supports other people more than she depends on them for 
help and support; is a vigorous type of personality, enter- 

ing the fight rather than seeking shelter; has entertained 
more than three plans of life seriously since her 15th year; 
wants to do something to contribute to God’s plan for the 
Universe; wants to do something of distinct value for the 
human race and the welfare of her own country. 


The eighth factor is interpreted as a “prepsychotic paranoid” 


type of maladjustment. 
The two girls with high loadings on the nin 
following configuration of traits in common: 


th factor have the 


has been very jealous of other per- 
sons; has had an intense dislike bordering on hatred for 
someone; life and actions guided at present by well-defined 
purpose; prefers tasks that are clearly outlined and easy to 
work at; prefers tasks in which she is very likely to succeed; 
music, literature, and art afford her passive enjoyment 
rather than active accomplishment; is dominantly inter- 
ested in things going on around her in the world in which 
she lives; plans and actions are mainly concerned with find- 


Likes to be alone; 
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ing someone whom she can love with all her heart; plans 
and wishes are mainly concerned with finding someone who 
will love her with deep, abiding devotion; dominant crav- 
ing is to love; is self-reliant rather than dependent on aid 
and comfort received from others; is ready to take vigorous 
steps to carry out her own plans; two or more persons have 
made her feel jealous; first made a serious decision about 
what she was going to do in life at sixteen or older; wants 
to do something to contribute to God’s plan for the Uni- 
verse; wants to equip herself for or improve herself in a 


professional career; is ready to take vigorous steps to over- 
come opposition. 


Jealousy and conscious hatred are the outstanding traits on 
the ninth factor and it is interpreted as a general schizoid, doublet 
factor indicating a “jealous” type of maladjustment. 


The following traits were common to the three girls with high 
loadings on the tenth factor: 


Has carried out to some significant degree her first 
seriously adopted plan of life; life and actions are guided 
at present by a well-defined purpose; wants to be engaged 
in tasks that require a great deal of ambition for their ac- 
complishment; prefers tasks in which she is very likely to 
succeed; prefers music, literature, and art to less aesthetic 
pleasures; if a person is widely admired, she will reserve her 
own judgment until she knows more about him; is domi- 
nantly interested in things going on around her in the 
world in which she lives; her fear of doing wrong is stronger 
than her fear of losing love or not being loved; is self- 
reliant rather than dependent on aid and comfort received 
from others; is ready to take vigorous steps to carry out 
her own plans; is ready to take vigorous steps to overcome 
opposition; helps and supports others more than she de- 
pends on them for help and support; wants to do something 
of distinct value for human beings and for the welfare of 


her own country; wants to do something to contribute to 
God's plan for the Universe. 


The tenth factor was tentatively identified as an “adjusted 
schizoid type of person.” 
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The following traits are held in common by the two girls with 
high loadings on the eleventh factor: 


Has a tired, worn-out attitude; has peculiar, abnormal 
fears; would give up and quit if things did not go her way; 
lacks self-confidence; is self-conscious; daydreams fre- 
quently; likes to be alone; has carried out to some signifi- 
cant degree her first seriously adopted plan of life; is more 
self-conscious than the average person; prefers music, liter- 
ature, and art to less aesthetic pleasures; plans and wishes 
are mainly concerned with finding someone whom she can 
love with all her heart; dominant craving is to love; her 
fear of doing wrong is stronger than her fear of losing love 
or not being loved; is self-reliant rather than depending 
on aid and comfort received from others; has a habitual 
tendency to keep aloof from almost everybody; wants to 
do something to contribute to God's plan for the Universe; 
wants to do something of distinct value for human beings 
and for the welfare of her own country 


The eleventh factor was identified as a withdrawal-from-reality 
type of maladjustment but without tendency to delusions. It 
indicates prepsychotic tendency toward simple schizophrenia or 
catatonia and the need for mental hygiene. 

It was noted that several traits appeared on almost all of the 
factors. Such nondiagnostic traits interfere with the differential 
interpretation of the factors and might well be eliminated if fur- 
ther refinement of the questionnaire were attempted. It was 
Noted that several of the doublet factors could be readily inter- 
preted, and the small number of variables on these factors was 
thought to indicate an infrequently occurring type rather than an 
Indeterminate factor. 

The results of the study were taken to indicate that “prepsy- 
chotic types” do exist in presumably normal groups (just as the 
Same syndromes may exist in both prepsychotic and psychotic 
individuals) and that personality structure should be considered 
continuous and multidimensional rather than made up of mu- 
tually exclusive qualitative types. 


186 Introduction to Factor Analysis 


P-TECHNIQUE IN THE STUDY OF PERSONALITY 


The conventional approach to intercorrelational 
studies has been to obtain measures of individual differences at a 
given point in time and to study their interrelationships. A varia- 
tion to this approach, introduced by Cattell (1946c), permits the 
study of the covariation of measures in a single person over a 
period of time. If a battery of tests is administered to a subject 
over and over again and his scores are correlated over the sample 
of occasions, the design is called P-technique (see p. 203). The 
relationship of factors obtained from this approach to factors ob- 
tained by the other techniques is, of course, of interest since, if 
similar factors should be obtained from intra-individual differ- 
ences as from inter-individual differences, they would help cor- 
roborate the existence and interpretations of the factors obtained 
from the more conventional designs. 

Tests which measure a changed function on re-administration 
(due to learning, etc.) are not suitable for this method, nor are 
very unreliable tests, because the variations in score from day to 
day might reflect chance fluctuation to a greater extent than 
changes in the function being measured. This approach should 
be of special interest to clinical psychologists, because it furnishes 
an objective method of studying the dynamics of an individual 
over a period of time. 

The pioneer study by this approach was done in the area of 
personality measurement (Cattell, Cattell, and Rhymer, 1947): 
A battery of “objective” psychological and physiological tests 
(listed in Table 10.12) was administered to a 29-year-old, normal 
woman on 55 weekdays. The sessions were held at various times 
between 8 a.m. and 10 p.m. to get a representative sampling of 
waking behavior and to determine the effect, if any, of fatigue on 
the test scores. 

The rotated loadings obtained from the intercorrelations of 


the objective tests are shown in Table 10.12. The factor loadings 


of two of the memory scores which were not included in the 


original analysis, because the subject reported she had used dif- 
ferent methods of memorizing at various times, were estimated 
from their correlations after the analysis had been made, when 
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TABLE 10.12. ROTATED OBLIQUE FACTORS IN OBJECTIVE 
PSYCHOLOGICAL AND PHYSIOLOGICAL MEASURES + 


Tests and Variables 


Time ofDoy s 2 eee eee eee eee ee 
Salivary pH (Alkilinity) .... 
Reversible Perspective, Rapidity . 
Myokinesis,Size of Lines Drawn 

Reaction Time ,Length of s... 
Reaction Time , Ratio of Reg. to Irreg.Warning 
Fluency of Association (T.A.T.) s.s... 
Perseveration ( Disposition Rigidity) 
Atoxic Sway Suggestibility . . 
Size,P.G.R. Deflection 


-18 ‘02 -03 -50 48 


‘02 -35 12 51 -47 
ao 602 02 25, -58 


27 -03 -48 01 -29 


Frequency, P.G.R. Deflection. .. « . 

Upward Drift, P.G.R. After Stress ... 2. 39 -=14 40 | .34 
Approximate Estimates Only 

Ratio Emotional to Unemotional Recall . 65 .40 .00  .00 

Efficiency Memory Recall s.s.s. 00 =45 00 .40 


1 From Cattell, Cattell, and Rhymer (1947, p. 284). 


it became apparent that their loadings would be consistent regard- 
less of the memorizing methods used. The four rotated, oblique 
factors were identified as follows: 

Factor I. Emotional Abundance vs. Emotional Dearth. 

Indicated by high loadings on sway suggestibility and size of 
Psycho-galvanic response. It is described as “emotional respon- 
Slveness to environment which might almost be called a dynamic 
Vigor or readiness, were it not that sway suggestibility has been 
Connected in the past with the more neurotic forms of emotion- 
ality.” 

Factor II. Physiological Ease vs. Emergency Alertness. 

Indicated by slow reaction time, salivary alkalinity, and slow 
reversible perspective. 
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Factor III. Fatigue vs. Energy Reserve. 

Scores with high loadings on this factor are for time of day, 
quickness of reversible perspective, frequency of perch galeanns 
response deflection, upward drift of psycho-galvanic response dur- 
ing relaxation after stress, and ratio of warned to unwarned re- 
action time. These scores are known to be related to fatigue and 
the factor is interpreted as “clearly one of diurnal fatigue.” 

Factor IV. Uncontrol vs. Inhibition. : 

Loaded on this factor are Scores on fluency of association, 
perseveration-rigidity (measured negatively), and myokinesis (large 
movements in drawing lines). Since these tests have been used 
as measures of “lack of integration and will” the factor is inter- 
preted as “some kind of lack of inhibition, with greater spon- 
taneity and carelessness,” 

At the beginning of each Session, the subject also took a ques- 
tionnaire, consisting of the two items loaded highest on each of 
the following ten factors isolated in previous personality studies, 
and was also rated on these factors by two close observers: 


l. Factor A. Cyclothyme ys, Schizothyme. (Rated principally 
on easy-going cooperativeness vs. obstructiveness.) 


2. Factor C. Emotional steadiness vs. general emotionality 
and neuroticism. 


3. Factor D. Hypersensitiy 
matic frustration tolerance. 
tention-getting behavior.) 

4. Factor E. Dominance ys, submissiveness. 

5. Factor F. Surgency vs, desurgency. (Rated principally on 
cheerfulness, placidity, talkativeness vs. worried, depressed.) 

6. Factor G. Positive character vs. immature dependence. 


(Rated principally on perseverance, persistence, and will quali- 
ties.) 


7. Factor H. Adventur 
thymia. (Rated principall 
shyness, withdrawal.) 

8. Factor I. Sensitive, anxious 
rigid, tough poise. (Rated 
reaction socially vs, poise.) 


€ sthenic emotionality vs. phleg- 
(Rated principally on excitable, at- 


ous cyclothymia vs. withdrawn schizo- 
: . . JS. 
y on friendly, outgoing behavior vs 


» imaginative emotionality V$- 
principally on jumpiness and over 


Applications in the Literature 189 


9. Factor J. Vigorous determined character vs. neurasthenia. 
10. Factor K. Intellectual, cultured mind ys. boorishness. 


(Rate principally on keenness of intellectual interests and analy- 
tical vigor of mind.) 


In order to determine the relationships between the factors 
derived from the “objective” tests and the factors represented in 
the ratings, a second analysis was done. The four “test” factors 
were represented by combining the scores of the two or three 
tests loaded highest on each (e.g., factor I was represented by Sug- 
Sestibility, P.G.R. Deflection, and half of P.G.R. rise). ‘The 
Yatings and self-ratings (which were derived from the question- 


naire) for each factor were combined when all three were posi- 
tively and Significantly correlated. This occurred in all but two 


of the factors: factor A for which the observer-rating and self- 
rating were treated as separate scores and factor K for which only 
the observers’ ratings were retained. In addition, there were two 
Scores obtained from the self-ratings only, self-sufficiency and day- 
dreaming-tendencies. All seventeen of these variables were cor- 
related over the sample of occasions and five factors were ex- 
tracted. Table 10.13 gives the rotated loadings of the variables 
as well as the cosines among the oblique factors. 

The factors were identified as follows: 

Factor I, Cyclothymia vs. Schizothymia. Variables 6, 12, and 
Ib have high loadings on the factor and it is interpreted as the 

more simple cyclo-schizo pattern. . . .” This factor agrees with 
actor A found in R-technique studies. 

Factor II. Surgency us. Desurgency (or Hysteria-Dysthemia). 

ariables 2, 10, 6, 15, 9, and 7 are loaded on this factor which 
agrees with factor F found in R-technique studies. 

Factor HI. Positive Character Integration vs. Immature, De- 
pendent Character. Variables 11, 16, 12, and 7 are loaded on this 
factor which is interpreted to agree with factor G of previous 

“technique studies. Variable 12 (friendly vs. shy) seems out of 
Place, but it is loaded on this factor to the same extent as in R- 


technique studies where it was referred to as “reserve” and “‘self- 
consciousness.’ 


” 
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PERSON- 
10.13. ROTATED OBLIQUE FACTORS IN 
ie ALITY RATINGS AND MEASURES? 


Variable 


Test Factor I, Emotional Abundance ü 


= 32 
2 | Test Factor I.Physiological Ease Coe ae -53 =03-.01 .06 
Emergency Alertness > ag 
3 | Test Factor I. Fotigue vs. Energy, . , eRe .02 07 306. .17 
-07| .44 
4 | Test Factor IY .Uncontrol vs Inhibition... . " -00 =06 -63 -07 
.48 
5 | Foctor A. Cyclothyme-Schiz. .. |, EY SO .02 .06 .24 .19 
(Inverse of Self-Rated Shyness, Slowness ) 69 
6 | Factor A. Cyclothyme-Schiz. E E re Nes are a +34 .23 -02 .37 
( Cooperative -Obstructive) z 70 
7 | Factor C.Stable Emotionally vs. Emotional , , ğ .37 .50 -.01 
77 
8 | Factor D. Self-Sufficient, Frustration Tolerant Š i 02 =07 .23 
62 
9 | Factor E. Self-Confident, Dominant st GEB Ses ee -.04 -02 .36 
vs. Submissive Mild Retiring 60 
10| Factor F. Surgent Vs Desing ong os ac g -.23 36 m0 
Cheerful , Content vs. Worrying „Anxious o7 | .87 
11 | Factor G.Persevering, Integrated, Se Roe eine cane 44 .06 = J 
Strong -Willed vs.Quiting , Fickle 78 
12| Factor H. Adventurous Cyclo-Schiz. . . . . +++. |.40 .05 =40 .04 .74 
(Friendly, Interested in People vs. Shy ,Caufious ) “ 
13 Factor 1. Jumpy, Easily -Embarrassed see eee. a25 0 07 .02 .24]- 
Over Active Imagination vs. Poised, Tough, Practical 4] .73 
14 | Factor J.Longuid ,Absent-Minded .. ce ee. . 1526 <07 =30 =25 -1 p 
Asthenic vs. Vigorous Orderly 46 
15| Daydreaming Tendencies PEE TE P .06 -47 .02|- 
8 
HGS i aE R e eae .42 .49 -06| -6 
47 
17 | Factor K. Intellectual, Analytical “et bve eee [205-06 .06 .06 226] - 
vs.Uninterested in Cultural Matters 


1 From Cattell, Cattell, and Rhymer (1947, P. 285). 
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Factor IV. Emotionally Stable Character vs. Demoralized 
General Emotionality. Variables 4, 7, 16, 15, and 10 are loaded 
on this factor which agrees with the D factor found in the R-tech- 
nique studies. It is described as “some sort of sobered, mature 
inhibition, as opposed to immature, frivolous waywardness.” 

Factor V. Adventurous Cyclothymia vs. Withdrawn Schizo- 
thymia. Variables 12, 1, 6, and 9 are loaded on the factor which 
agrees with factor H of the previous studies. In this P-technique 
study two cyclothyme-schizothyme factors were found as is also 
the case in the R-technique studies. The first one (factor I) is 
considered to be “the more general, generic one” whereas in this 
factor the “schizothyme pole is distinguished by withdrawal. . . . 
Also there is a lack of energy, adventurousness, and self-confi- 
dence.” 

It is of interest to find that the day-to-day variations in one 
person brought out a number of the same personality factors as 
were derived from inter-individual differences and lends further 
SUpport to their existence and interpretation. The intercorrela- 
tions among the oblique factors, with a few exceptions, agree well 
With those found in R-technique studies. As Cattell points out, 

the exact correlation among factors in one subject promises to 
furnish a new indication of personality uniqueness and one which 
may be of considerable diagnostic value.” 

The alignment of some of the test factors with factors based 
On ratings and questionnaires suggests a possible objective-test 
battery for some of the primary personality factors and helps to 
clarify the functional interpretation of these factors. 


CHAPTER II 


Some General 
Considerations 


SIMPLE AxEs AND PRIMARY AXES 


O. SOURCE of confusion in the use of oblique 


solutions has been that two types of coordinate systems have been 
used. They are illustrated below. 


Actually the two types of coordinates are even more different 
than they appear above, since they are taken with relation to dif- 
ferent sets of axes. Those on the left are called simple axes, and 
those on the right are called primary axes. Presenting the two 
sets of axes together as in Figure 11.1 will help to indicate their 
relationships. Two oblique planes 4 and B (or rather the traces 
of the intersections of the planes with the plane of the page) are 
shown. The normals (unit vectors perpendicular at the origin) 
to these planes are called the simple axes (V1, V2), and the loadings 
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called for by simple structure (vjz) are the perpendicular projec- 
tions of the test vectors on these reference axes. This type of load- 
ing has been used by Thurstone because of his interest in simple 
structure and his intention to pass planes (or hyperplanes) through 
clusters of test vectors in order to maximize the number of near- 
zero loadings. The emphasis is on the location of the planes, and 
the axes are placed orthogonal to the planes. If, however, the 
emphasis is placed on the axes, and they are passed through the 
clusters of points, another type 
of axis is obtained, referred to 
as the primary axes (Pi, Ps). 
This type of axis has been used 
by Holzinger and Harman. 
The first primary axis isin the faz 
Mace of the B-plane and the 
second primary axis is in the 
trace of the d-plane. From the 
diagram (Figure 11.1) it can be 
seen that there is a proportional 
relationship between the two n 
types of loadings. The values Ficure 11.1 The relationship of simple 
from One eat be commerted to and i ee ae to two 
que p 
the values of the other (Harris 
and Knoell, 1948), and hence the conclusions should be similar, 
Whichever set of axes is used. 

One other distinction should be made. Figure 11.2 indicates 
the relationships of the axes to three oblique planes. Simple axis 
V, is the normal to plane A, etc. Primary axis P, is at the inter- 
Section of planes B and C. The projection of test j on Va can be 
obtained by dropping a perpendicular to plane B. To obtain the 
Projection of test j on axis P», pass a plane through point j parallel 
to plane B. The length of Ps, that lies between plane B and the 
plane parallel to it, is the projection of test j on Pe (aj). The less 
oblique the planes, the closer together are any pair of simple and 
primary axes (e.g., P2 and Vo), and in the orthogonal case the two 
Sets of axes coincide. The loadings on the simple axes are perpen- 
dicular projections, whereas the loadings on the primary axes are 
oblique projections. Perpendicular projections represent correla- 


v 


Troce of 


=v 
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tion with an axis and are referred to as structure values. Oblique 
projections represent the coordinates of the points on Cartesian 
axes and are referred to as pattern values. Since pattern values 
are coordinates rather than correlations, they may be greater than 
1.0. Obviously, both types of values can be obtained on either 
primary or simple axes, and Holzinger and Harman (1941) usually 


~, s 
y? A 


V3 


FIGURE 11.2 The relationship of simple and primary axes to three oblique planes. 
report both sets of values for the primary axes. It is of interest 
to note that it is the obli 


que projections (pattern values) on the 
primary axes that are the loadings (ax) needed for the linear 


€quations which describe a person’s standard score as a weighted 
combination of his factor scores, 


OBLIQUE ys, ORTHOGONAL SOLUTIONS 


The location of the reference axes is in the nature 


investigator. One source of disagreement has been the issue of 
whether the reference axes (and hyperplanes) should be restricted 
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arbitrarily to orthogonality or whether the angles among them 
should be allowed to vary from analysis to analysis as indicated 
by the data. Some of the arguments advanced by those partial 
to orthogonal axes include: 


Independence. Orthogonal axes are uncorrelated and repre- 
sent theoretically independent factors. If the practical measures 
(tests, etc.) are not as independent as are the theoretical factors, 
this may be an indication that independent measures have not yet 
been developed rather than that the factors are interdependent. 
Such an interpretation would represent a challenge to construct 
new measures, based on the interpretations of the factors, that 
would vary independently. Not until it has been demonstrated 
that such independent measures cannot be constructed for a given 
Population should one resort to correlated factors. 


Simplicity, Orthogonal factors are much simpler to work with 
computationally and graphically. Graph paper, of course, usually 
1s ruled orthogonally, and orthogonal axes can be represented on 
it accurately, If orthogonal graph paper is used to represent 
oblique axes, as is sometimes done, allowance must be made for 
distortion in the representation. Computationally, likewise, or- 
thogonal axes are simpler since the identity matrix represents 
the angular separation of the orthogonal factor vectors and does 
not have to enter into the reproduction of the correlation matrix 
from the factor matrix. Also the calculation of an inverse is 
avoided when dealing with orthogonal factors since the inverse 
of an identity matrix is itself an identity matrix. Furthermore, 
there are a number of distinctions such as simple axes and primary 
axes, pattern and structure, which are needed for oblique axes 

Ut are unnecessary for orthogonal axes since the distinctions dis- 
appear in the orthogonal case. 


Instability of Angular Separations. To some extent at least 
the Obliqueness of the axes in any one study is determined by 
sampling instability and the angular separations can be expected 
to fluctuate from sample to sample. 


Similarity of Results. Except for instances of highly correlated 
factors, there is little difference in the interpretation of the factors 
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derived from the two approaches and the added theoretical and 
computational complications of oblique factors make little prac- 
tical difference. There is also the problem of how oblique the axes 
should be permitted to be before they no longer can be considered 
independent, but blend into one factor. 


The following are a few of the advantages claimed by those 
partial to oblique axes: 


Important Information. The amount of obliqueness is itself 
important information. If factors indicated by the data are cor- 
related (dependent), this should be taken into account in interpret- 


ing the results. Also second and higher-order analyses can be made 
of the intercorrelations of the factors. 


Better Fit of Axes. If the hyperplanes are passed exactly 
through the clusters of test vectors representing the sample data, 
a more compelling simple structure can be obtained. This might 
be simple structure for one analysis rather than simultaneous sim- 
ple structure over a series of analyses (see Chap. 8), however, since 
sampling fluctuations will determine, in part, the location of the 
oblique axes. It is true that there is some “play” in the location 


of orthogonal axes when the test vectors themselves are not orthog- 
onal, which might lead to a les 


s unique rotational solution for a 
specific matrix. 


Closer Duplication of Nature. A number of measures, as they 
commonly appear in nature, tend to be related. A commonly cited 
example is the height and weight of persons. Even though inde- 


pendent measures of these traits could be developed the correlated 
measures of height and weight are useful, 


FREQUENTLY IDENTIFIED Facrors 

Factors are hypothetical constructs or intervening 
variables which can be related both to the stimulus (test items: 
situations, etc.) and to the response (answers or behavior of exam- 
inees). If a factor can be recognized and identified in a wide 
variety of situations and conditions, 
functional unity. There are probab 
sient” factors which have a temporary 


Best g a 

It is thought to represent 
gran 

ly a large number of “tra! 
i : se 

or limited existence becau 
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they represent a temporary or local influence. Thus in the Air 
Force studies during World War II a pilot-interest factor was 
found in several analyses (Guilford, 1947b). During that period 
this was an important, stable dimension for describing the be- 
havior of the aviation-cadet population. Such limited factors 
probably can be obtained from any group subjected to uniform 
influences, training, and background. Most of the factors in 
Which we are interested have a broader scope. They have a more 
common basis such as schooling, widespread cultural influences, 
and possibly genetic background. Factors, then, are statistically 
derived unities, and their interpretation and generalization follow 
the accepted scientific methods for interpreting and generalizing 
data. The factor content of a test should be invariant for a given 
Population when analyzed in successive batteries having the same 
Sets of factors. As selection of the population varies with age, 
training, and other background variables, shifts in the factor 
loadings may be expected. 

__ A number of aptitude and achievement factors have been iden- 
tified in an adequate number of studies over sufficiently broad 
Populations to lend some confidence in their stability and wide 
Scope. Such a list of factors based largely on adolescent and adult 
Populations is given by French (1952, p. 12) as: 


Established Tentative 
Aiming (eye-hand coordination) Age 
Attention? Ambidexterity 


Deduction (“general reasoning”)? 
Finger dexterity 
Flexibility of closure? 
Fluency of expression 
Grades? 
Induction? 
Ideationa] fluency? 
udgment 
€ngth estimation 
Ssociative (rote) memory 
anual dexterity 


Mechanical experience 
Number 


Auditory resistance 

Articulation 

Carefulness 

Culture 

Eduction of perceptual relations 
Eduction of conceptual relations 
Eduction of conceptual patterns 
Eduction of correlates 

Figure illusions 

Integration 

Logical reasoning 

Loudness 

Mathematics background 
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Established 
Perceptual speed 
Psychomotor coordination 
Pilot interest 
Schooling 
Social studies 
Space 
Spatial orientation 
Speed? 

Speed of closure 
Verbal comprehension 


Introduction to Factor Analysis 


Tentative 
Musical memory 
Music 
Naming 


Perceptual alternations 
Pitch quality 

Planning 

Public speaking 
Reaction time 

Sex 

Speed of judgment 


Visualization 
Visual memory 
Word fluency 


Span memory 
Symbol substitution 
Tapping 


After reviewing the literature of factor analysis, French (1951) 
listed as “established” those factors which had been found with 
reasonable clarity in at least three different analyses. The ques 
tion marks placed after some of the established factors indicate 
that the factors could be expected to appear with some confidence 
if suitable tests were included in the battery, but their psycho- 
logical interpretation or scope is not well determined. If a factor 
was interpreted with reasonable clarity in only one or two analyses 
it was listed as tentative. Descriptions of the factors and tests 
which define them are given in French (1951). 

Results in the area of personality have been even less con- 
clusive. Several tentative lists of personality factors have been 
compiled. After reviewing the evidence from three types of pet 
sonality measures (behavior ratings, self-inventories, and objective 
tests) Cattell (1946c) considers the following twelve bipolar factors, 
which are indicated by their extremes, to be the most general: 


1. Cyclothymia vs. schizothymia. 


2. Intelligence, general mental capacity vs. mental defect. 


3. Emotionally mature stable character vs. demoralized ge” 
eral emotionality. 


4. Dominance-ascendance vs. submissiveness. 
5. Surgency vs. agitated melancholic desurgency. 
6. Sensitive, anxious emotionality vs. rigid tough poise. 


aed 
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7. Trained, socialized, cultured mind vs. boorishness. 

8. Positive character integration vs. immature dependent char- 
acter. 

9. Charitable, adventurous cyclothymia vs. obstructive, with- 
drawn schizothymia. 

10. Neurasthenia vs. vigorous, “obsessional determined” char- 
acter. 

11. Hypersensitive infantile sthenic emotionality vs. phleg- 
matic frustration tolerance. 

12. Surgent cyclothymia vs. paranoia. 


Considerable modification of both of these lists of factors can 
be expected as a result of further investigations.* 

A group of factor analysts who met at the Educational Testing 
Service in Princeton, New Jersey, in November, 1951, is engaged 
in the task of clarifying some of the more common aptitude 
factors and recommending several standard reference tests for 
each factor (French, 1952).2 The Educational Testing Service 
intends to serve as a repository for these tests and make copies 
available to persons who desire to use them for research purposes. 
Similar batteries of reference tests of personality and achievement 
factors are to be developed in the future. 


INFLUENCES oN FACTOR LOADINGS 

A number of conditions that influence the outcome 
of a factor analysis must be taken into account in the setting up 
and interpreting the results of a factorial study. Some of these 
considerations are the selection of the sample, the type of correla- 
tion coefficient used, the conditions of administration, and the 
Scoring formulas applied to the tests. 


Selection of the Sample. The major considerations in the 


selection of the sample are the range of the group on the traits 
ny 

_ lSince the above was written, French (1953a) has issued an extensive compila- 
tion of factorial studies and results in the area of personality measurements. 

_ 2 Tests of this type which are used to establish the presence of a factor in a 
given population have been variously referred to as reference tests, marker tests, 
tie-in variables, anchor tests, etc. 
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measured by the tests, and the degree of selection on traits cor- 
related with the factors. If the purpose of the study is merely 
the identification of factors in a new area of investigation, the 
wider the range the better. If, however, the results are to be 
generalized to a carefully defined population, the sample must 
be representative of that population. Since factor loadings (pat- 
tern values) are average weights, putting scores from unlike pop- 
ulations (e.g., men and women on strength tests or mechanical 
tests) together will result in loadings which are averages for the 
groups which have been combined (Jeffress, 1948a). 


Background, Training, Experience, and Set. Selective back- 
ground population characteristics, such as age, sex, education, and 
training, may influence factor content. As has been pointed out 
by Thurstone (1947c), “factors cannot be expected to be invariant 
from one population to a different population.” A vocabulary 
test of moderate difficulty may be a measure of reasoning for ele- 
mentary school children but a measure of perceptual speed for 
college students. Items at different difficulty levels in an appar- 
ently homogenous test may emerge as measures of different func- 


tions even though methods of correlation that minimize the in- 
fluence of difficulty level are used. 


Even more vexatious to the test constructor and factor ana- 
lyst is the problem that different examinees may employ different 
mental processes in solving an item. In solving a spatial problem, 
for example, some may rely on verbal and reasoning processes, 
while others may use visual imagery. Such ambiguity can prob- 
ably be minimized by trying to find tasks that most persons solve 
in the same manner and by carefully selecting populations so that 
such differences are held to a minimum. 

The processes by which problems are solved can be controlled 
somewhat by means of a set given either in the directions or in 4 
training period during which a given routine for solving the prob- 
lems is taught. If the members of a group are at different stages 
on the learning curve of the traits being tested, they may us? 
different processes to do a task. Thus, in learning telegraphy» 
typewriting, or reading, quite different factors are measured bY 
the task at different points along the learning curve. speeded 
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scores, also, have quite different meanings for different ages, diffi- 
culty levels, and stages of learning. 


Type of Correlation Coefficient. The most satisfactory type 
of correlation coefficients for continuous variables are product- 
moment correlations. The most consistent results are obtained 
when distributions have been normalized, but satisfactory results 
are also obtained from the correlations of the raw scores if the 
assumptions (e.g., rectilinearity) for the correlation method used 
are satisfied. Biserial and rank-difference correlations may be 
used as estimates of the product-moment correlation coefficient, 
although it is probably best to use only one type of correlation 
coefficient in a study. For large samples tetrachoric coefficients 
may be used if the splits on the variables are not too extreme and 
the assumption of rectilinearity can be justified. 

“The phi coefficient (fourfold point correlation) should not be 
used unless some correction is made to avoid spurious factors 


due to differences in splits of the dichotomized variables. 


Experimental and Linear Dependence. It has been pointed + 
out by Thurstone (1947c), Guilford (1952a), and others, that scores 
that are experimentally dependent should not be included in a 
Correlation matrix for factor analysis because of the spurious ele- 
ment introduced by the correlation of chance and error variances. 
Examples of such experimentally dependent scores are speed and 
accuracy scores taken from a single performance, rights and wrongs 
Scores, scores derived from overlapping keys as in the case in some 
interest and personality inventories. If several scores are derived 
from a single performance as is the case with a number of the 
Rorschach indices, they would not be experimentally independent 
scores, 

Spurious correlations may also be due to linearly dependent 
Scores such as correlating a total score with a part score which 
entered into the total score or the correlation of the ratio or dif- 
ference between two scores with one of the scores which entered 
into the ratio or difference. 


Time Limit and Scoring Formula. The extent to which a 
test is speeded may have some influence on its factor content, 
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because the correlation between speed and level (power) scores 
may vary anywhere from high positive to negative. General speed 
factors have been found in several batteries of speeded tests. Also, 
the more speeded a test is, the more likely the rights and wrongs 
scores are to be independent (for “power” tests the correlation 
between rights and wrongs is —1.0). Relatively independent 
rights and wrongs scores may differ in their factor content, and 
the factor composition of a total score which is a combination 
of such independent rights and wrongs scores will vary depend- 
ing on the scoring formula used (Fruchter, 1953). The reliabili- 
ties of the scores also have an effect on the size of the factor load- 
ings and formulas for estimating the changes in loadings, because 


changes in reliability have been presented by Guilford and 
Michael (1950). 


EXPERIMENTAL DESIGNS 


Cattell (1946c and 1952a and b) has pointed out 
six possibilities for studying covariation in tests, persons, and 
occasions and has designated them O- and P-technique, Q- and 
R-technique, and S- and T-technique. These designations refer 
to the experimental designs by which the intercorrelations are 
obtained and each technique in a pair is the transpose of the other. 


Q- and R-techniques. We will begin with this pair because 
R-technique has been used in the vast majority of studies to the 
present and is the most familiar. R-technique is the well-known 
correlation of tests administered to a sample of persons on a single 
occasion. It indicates the extent to which the tests covary over 
a series of persons on one occasion or under one condition. Q- 
technique, the transpose of R-technique, indicates how two oT 
more persons covary over a series of tests administered on one 
occasion. Actually the term test is somewhat misleading here 
because, in correlating persons, the scores of an individual on 
all tests must be averaged (just as in correlating tests, the scores 
of all persons on the test must be averaged). Now, if the scores 
of the tests are expressed in different units (e.g., so many units of 


reading comprehension, so many units of perceptual speed, and 


sO many units or reaction time, etc.), it will not make sense tO 
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average them. Therefore Q-technique has found its greatest use- 
fulness where some kind of a “person continuum” can be as- 
sumed, and the various “tests” (really items) are placed on this 
continuum. Examples are esthetic and other types of preference, 
and attitude and personality questionnaires where the subject can 
tell to what extent he likes or dislikes an item or the extent to 
which an item describes him. The various “scores” are, conse- 
quently, in terms of the same units (e.g., the person’s preference) 
and can be averaged. This type of measurement has been termed 
ipsative (relative to the self) as contrasted with the usual type of 
measurement, which is normative (relative to the group). If two 
persons answer a series of questions or rank a series of stimuli 
similarly, they will be highly correlated. The factors derived 
from Q-technique (obverse factor analysis) have persons loaded 
on them and each factor represents a hypothetical person of a 
given “type.” (For examples of obverse factor analysis, see Chap- 
ter 10.) 


O- and P-techniques. Of these two, the more common is 
P-technique. It consists of correlating a group of tests adminis- 
tered over a series of occasions to one person. By this design it 
should be possible to determine whether, where a battery of tests 
is administered repeatedly to one person, any of the tests covary 
In time. Tests which measure the same factors in an individual 
should vary as that factor in the individual fluctuates with time. 
This approach is especially useful with dynamic variables such 
as Personality traits that do fluctuate with time. It could also 
serve to help identify and confirm the factors obtained from R- 
technique if the same factors were found in a battery of tests, 
Used to obtain intra-individual differences, as were found in a 
battery of tests used to obtain inter-individual differences. It 
should make possible a description of the dynamics of an indi- 
vidual in terms of relatively objective factors as well as the com- 
Parison of the factors involved in the dynamics of two or more 
Individuals, (See Chapter 10 for an example of the application 
of P-technique.) 

O-technique, involving the correlation of occasions for a series 
Of tests for one individual, is the transpose of P-technique. It 
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indicates the extent to which two or more occasions covary over 
a series of tests for one person. By this type of analysis it should 
be possible to group occasions by the ways in which they influence 
performance. It has been observed that the same person may 
behave quite differently on different occasions; he may be inde- 
pendent or dependent, shy or not shy, moral or immoral under 
varying circumstances as his role changes. It should be possible 
to determine whether different occasions or conditions systematl- 
cally affect the performance of a person. If the occasions have 
been varied in some systematic way the interpretations of the 
factors should be more meaningful. Since this method (as was 


” 
also true of Q-technique) requires summing over a series of “tests, 
the scores must be in terms of similar units. 


S- and T-techniques. S-technique consists of determining the 
extent to which two or more individuals vary together over a 


series of occasions or trials on one task. It should be useful for 


determining groups homogenous with regard to a given trait over 
a series of occasions. 


Traits which vary in the individual on dif- 
ferent trials or occasions, as in learning studies or in different 
social situations, would be most appropriate for this type of analysis 
also. Varying the occasions in a systematic or controlled manner 
would probably lead to the most meaningful “person” factors. 

T-technique, the transpose of S-technique, indicates how two 
or more occasions covary over a series of individuals on one test. 
The most familiar example of this is, of course, test-retest relia- 
bility. If the test is of a dynamic trait that fluctuates consider- 
ably under differing circumstances, this approach might be useful 
for determining the basic types of occasion factors affecting per- 
formance on the test. O-, S-, and T-techniques have been out- 
lined as theoretical possibilities. Few if any applications have been 
made of them up to the present time. (However, see the study 


by Wherry reported in Chapter 10 as a possible application of 
T-technique.) 


THE ESTIMATION of FACTOR SCORES 


A score derived from a complex test, which 1s 4 
measure of several factors, may be ambiguous in that we do not 


3 Mower (1953b) refers to these as M- and N-techniques, respectively. 
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know what combination of the factors may be represented by any 
individual's score. A score derived from a “pure” test is rela- 
tively unambiguous, since it represents a single trait or functional 
unity. It is said to be “univocal.” Factor scores may be com- 
puted in several ways, the fundamental principle being to obtain 
the weighted combination of the tests, by multiple correlation, 
that best predict a factor. For this purpose the loadings (structure 
values) of the tests or variables on the factors are used as validity 
coefficients. 

Of course, if a pure, reliable measure with a satisfactorily high 
Saturation on a factor exists, the scores on it can be used as the 
factor scores. If a pure measure of a factor is not available, but 
there are several tests with high loadings on the factor and sec- 
ondary loadings on nonoverlapping factors, either the sum or a 
weighted combination of their scores may be uesd as an estimate 
of the factor scores. Guilford and Michael (1948) have presented 
convenient equations for obtaining univocal factor scores, where 
the suppression of one or more secondary factors is desired. ‘Their 
€quations also make it possible to maximize the variance on the 
desired factor, or minimize the variance on the undesired factors, 
or obtain any given ratio of the two types of variance. 


MECHANICAL As To COMPUTATION 

i One practical limitation to the widespread applica- 
tion of the factorial methods has been the large amount of labo- 
rious computation involved. Approximate methods, such as are 
provided by the centroid solution, have been used to reduce the 
labor within reasonable bounds. The present and future de- 
velopment of high-speed computers should do much to cut the 
amount of time and labor required for a factorial solution and 
make feasible the use of more powerful statistical techniques, such 
as the maximum-likelihood method (Lawley, 1943a, 1950 and 
1951), which provide mathematically efficient estimates of factor 
loadings and suitable tests of significance. 

The major computational problems in factor analysis are the 
extraction of factors, the plotting and rotation of axes, the multi- 
plication of the factor matrix by the transformation matrix, and 
the calculation of inverse matrices. 

A method for extracting principal axes by means of IBM 
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equipment has been devised by Tucker (1946). The se 
setup that is desirable for this method includes a key punch a 5 
verifier, a sorter, an alphabetical accounting machine, and a Ta 
mary punch. Tucker (1942) has also devised a ee 
method for centroid factors, and Hall, Welker, and Craw eT 
(1945) have made an adaptation of this method to more common'y 
available machines and have also introduced some labor saving 
steps. , 

Tie Watson Scientific Computing Laboratory at Columbia 
University has an IBM sequence-controlled electronic calculator, 
which can extract principal-axis or maximum-likelihood a 
rapidly and furnish a running chi-square test to test the signifi- 
cance of the extracted factors. 

For the plotting of factors the General Electric Company has 


developed, for the Adjutant General's Office, a device that plots 


‘ ; . tes 
points for two axes at a time on an oscilloscope screen, rota 


electronically, and does some of the matrix-multiplication ee 
tions involved. A method for matrix multiplication by means © 
a special attachment to an IBM tabulator has been devised by 
Tucker (1940b). The only copy of this attachment, however; 8 
at the University of Chicago. The steps in matrix multiplication 
on electronic calculators have been outlined by Brannan (1952) 
and an application of electronic computers to factor analysis re 
been made by Wrigley and Neuhaus (1952). Hotelling (194 : 
has stated some of the problems that need to be solved to dé 
velop more efficient methods for 

In general the com 
pensive, 


computing inverses. . 
puting devices are cumbersome and oe 
and they are efficient only for large problems and ha 
laboratories where several analyses are being done simultaneous i 
The time involved in setting up the equipment for several eo 
yses is usually about the same as is required to set up one analysis» 


and the time needed to run the data through the machines ui 
usually negligible by comparison, 


NEXT STEPS IN FACTOR ÅNALYSIS 


. Cc 
Considerable theoretical development and oe 
m of factor analysis have occurred in the past pe 
owever, still remains to be done in both of th 


tical applicatio: 
ade. Much, h 
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areas. In the first place, better planned studies are needed. 
Applying factor analysis to a table of intercorrelations just be- 
cause it happens to be available cannot be expected to yield mean- 
ingful results. The test battery should be carefully constructed 
or assembled with specific hypotheses in mind, and all aspects 
of the area under investigation should be adequately covered (in 
more technical language “‘overdetermined”) to make possible clear 
interpretations of the factors obtained. 

A great deal of work remains to be done on the interpretation 
of even the better-known factors. It has been suggested, for in- 
stance, that the essence of the numerical-facility factor is not 
number, but the ability to deal with a highly practiced closed 
system of associations. Where an interpretation of a factor is 
made, or there are alternative hypotheses, new tests which call 
for the hypothesized behavior should be constructed and analyzed 
in order to verify or reject the interpretations. The nature and 
extent of the factors should be studied by carefully controlled, 
nonfactorial methods to determine how they vary under different 
conditions and with different populations. The following are 
Some questions that need to be answered in this connection: Is 
the function represented by a factor amenable to training or in- 
fluenced by set or background? Are some factorially complex tests 
complex only because different individuals in the population do 
the task differently? If so, can the methods by which individuals 
solve problems be ascertained, and can the population be sub- 
divided into more homogenous groups or made more homogenous 
by training or set? 

Very few investigations of how factors change with age, sex, 
Speeding, difficulty level, etc., have been done. 

In connection with better designed factorial experiments, im- 
Proved theoretical models, in hyperspace, are needed for the 
rotational problem, so that criteria for unique solutions can be 
developed, Tucker (1951) has developed methods for rotating 
Several batteries to “congruent” solutions for those factors that 
have overlapping variables and has also developed the factor 
analysis of covariances so that differences in group variability may 
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be taken into account. Burt (1952) has suggested an approximate 
formula for the standard error of a factor loading under certain 
restricted conditions. 

The factorial methods need to be extended to nonmetric data 
also. Steps in that direction have been taken by Lazarsfeld (1950) 
and Burt (1950b). 

With regard to application, many more analyses which include 
independent criterion measures need to be done, both to help 
clarify the nature of the criteria and to help determine their rela- 
tionship to factorially-defined predictive measures. Additional 
studies need to be made using P- and S-techniques, so that the 
dynamics of individuals and groups over a series of trials or occa- 
sions can be determined. O- and T-techniques should be applied 
more extensively so that trials or occasions can be classified accord- 
ing to the common factors operating in them for one person on 4 
number of traits, and for single traits on a number of persons. 
With more rapid methods of computation available, it is expected 
that factor analysis will be applied more frequently as a method of 
item analysis in the construction of homogenous tests. 

With the increasingly insightful and accurate application of 
factorial methods not only will new understandings and control 
of social behavior be achieved, but applications will also be made 
to the many other sciences where the basic variables underly- 


ing complex multivariate interrelationships must be isolated and 
identified. 


+ The following are some analytical methods for rotation of axes that have been 
published recently: 


J. B. Carroll. An analytical solution for approximating simple structure in factor 
analysis. Psychometrika, 1958, 18, 23-38. 

J. O. Neuhaus and C. Wrigley. The quadrimax method: an analytical approach 
to orthogonal simple structure. Memorandum Report A-6, University of Illino!s, 
Contract No. AF 33(038)-25726, August 1953, 33 pp. (mimeographed). ii 

D. R. Saunders. An analytical method for rotation to orthogonal simple st™™ 
ture. Educational Testing Service Research Bulletin 53-10, August 1953, 28 PP: 

L. L. Thurstone. Analytical method for simple structure. Psychometric Labora 
tory, University of North Carolina, Report No. 6, July 1953, 14 pp- 
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CALCULATION OF THE ESTIMATED FACTOR LOAD- 
INGS OF A VARIABLE NOT INCLUDED IN THE 
ORIGINAL ANALYSIS + 


i, ir is desired to estimate the loadings of a 
variable, relative to the factors in a battery in which it was not 
included, and the correlations of the variable with the other tests 
in the battery are known, it may be done by the following steps: 


___ 1. Place the rotated factor loadings of the original test battery 
in a matrix with the tests running vertically and the factors hori- 
zontally. Assume an example with four tests and three factors. 


Rototed Factor Matrix 


Factors 


_—_ 


1 This method was first presented by Dwyer (1937). An extension of the method 
to the simultaneous estimation of the loadings of several variables not included 
in the original analysis was given by Mosier (1938) in matrix notation as follows: 


Vip = R'E mG ma Tmp 


Where V1, = factor loadings of new tests on the rotated factors; 
R’; = transpose of the matrix of intercorrelations of new tests with tests already 
factor analyzed; 
Pin = transpose of factor loading: 
GT nat = Eni ims 
Tmp = transformation matrix from unro' 
209 


s of analyzed tests on the unrotated factors; 


tated to rotated loadings. 


DI: Appendix 


2. From the above matrix, a factor-estimation matrix may be 
formed by the following steps: 


(a) In the factor estimation matrix shown below, the diagonal 
values, cells (la, 1), (2a, 2), (3a, 3), etc., are found by summing 
the squares of the values in each factor column (f°). Retain 
all values to twice as many decimal places as they were given in 
the rotated factor matrix. 


Rotated Factor Matrix Factor Estimation Matrix 


(b) If there are four factors in the rotated factor matrix, the 
factor estimation matrix is a four-by-four matrix. Similarly, for 
n factors an n X n matrix would be obtained. 

(c) The value in cell (la, 2) is the algebraic sum of the prod- 
ucts of the factor loadings in factor column I times the corre- 
sponding loadings in column II (SF; Fy). 


Rotated Factor Motrix Factor Estimation Matrix 


factor loadings in column III (SF, Fin). 


Rotated Factor Matrix Factor Estimation Matrix 


I pig 
a 8 
33 25 
wat -è 
Kd od 


EF; Fy =.78 


—— 
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(e) In a four-factor problem, cell (la, 4) would be the algebraic 
sum of the products of the factor loadings in column I times the 
corresponding values in column IV. For larger problems, the 
additional values in row la are obtained by a similar process of 
cumulative summing of cross-products. 

(£) The value in cell (2a, 1) is found by the same procedure 
as was used for obtaining the value in cell (la, 2) and may be used 
as a check on cell (la, 2). (See step 2c.) 

(g) The value in cell (2a, 3) is the algebraic sum of the prod- 
ucts of the factor loadings in column II times the corresponding 
factor loadings in column III (SFr Fim): 

(h) The values in cells (3a, 1) and (3a, 2) are obtained in the 
same way as those in cells (la, 3) and (2a, 3) respectively, and may 
be used as a check on those values. 


Rototed Factor Matrix Factor Estimation Matrix 


(i) For larger problems, the values in the remainder of the 
factor estimation matrix can be found by extending the procedure 
outlined above. 


3. The next column to be added on the right side of the factor 
estimation matrix is the validities column, which is headed 
Valid.” in the example. This is obtained by the following steps: 


(a) List to the right of the rotated factor matrix the intercor- 
relations of the test (£) whose factor content is to be estimated 
With the tests of known factor content. (For the example, assume 
Tit = .7, Ta = A, Tst = 35 Ta = 1.) 


Rotated Factor Matrix 
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(b) The value in cell (la, Valid.) of the factor estimation 
matrix is the algebraic sum of the products of the loadings in 
column I times the corresponding correlation coefficients in 
column ż (Ft). 


Rotated Factor Matrix Factor Estimation Matrix 


(c) The value in cell (2a, Valid.) is found in like manner, using 
column II and column ¢ (=F). 

(d) The value in cell (3a, Valid.) is found as above, using 
column III and column ¢ (Fit). 

(e) For problems with more than three factors, the remainder 
of the values in the “Validities” column are obtained in a similar 
manner. 


4. The next column in the factor estimation matrix is a 
“check” column, and the values in it are obtained by summing 
the values in each row, including the validities column, regarding 


algebraic signs. The use of the check column will be explained 
in step 5e. 


Factor Estimation Matrix 


-62 |3.07 ( 1.47+4.20+.78 
Ba +.62=3.07, etc) 


5. The factor estimation matrix is now completed. Solution 
for the factor loadings is somewhat analogous to operations on a 


multiple-correlation matrix in computing beta weights by the 
Cowles-Crout method.? 


2See J. T. Cowles, “A labor-saving method of computing multiple correlation 
coefficients, regression weights, and standard errors of regression weights,” Tech- 
nical Memorandum No. 2 (Lackland AFB: Psychological Research and Examining 
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(a) Obtain cell (1b, 2) in the factor estimation matrix by divid- 


ing cell (la, 2) by cell (la, 1). It is suggested that the b-row entries 
be made with colored pencil. 


1b, 2 20 14 
( a) =e 


(b) Obtain cell (1b, 3) by dividing cell (la, 3) by cell (la, 1). 


-78 
1b, 3) = — = .5 
(1b, 3) 1.47 9 


Factor Estimation Matrix 


(c) In a four-factor problem, cell (1b, 4) would be obtained 
by dividing the value in cell (la, 4) by the value in cell (la, 1). 
For larger problems, the additional cells in the first row are ob- 
tained in a similar manner, i.e., the value in cell (lb, n) would 
be obtained by dividing the value in cell (la, n) by the value in 
cell (la, 1). 

(d) The values in cells (1b, Valid.) and (1b, Check) are found 
in like manner. l 

(€) For any row the values in the b cells of the Check column 
should equal 1.00 plus the sum of all values in the b row to the 
right of the diagonal value, including the entry in the validities 
column; this sum is entered to the right of the check values of the 
b rows in a column headed “Row Sum + 1.0” (see matrix in step 
5s, on page 216). In the example, the first value in this column is: 


1.00 + .14 + .53 + .42 = 2.09 


— 


Unit, 17 Feb. 48), and G. H. Dash, “A simplified procedure for computing multiple- 
correlation coefficients and regression weights by the Cowles-Crout technique,” 
Technical Memorandum No. 12 (Lackland AFB: 3309th Research and Development 
Squadron, 25 Feb. 49). 
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(£) Next, obtain the value in cell (2b, 2) by subtracting from 
the value in cell (2a, 2) the product of cell (2a, 1) and cell (1b, 2). 


(2b, 2) = .23 — (,20)(.14) = .20 


(g) Next, obtain the value in cell (3b, 2) by subtracting from 
the value in cell (3a, 2) the product of cell (3a, 1) and cell (1b, 2). 


(3b, 2) = -13 — (.78)(.14) = .02 


(h) For a four-factor problem, the value in cell (4b, 2) would 
be obtained by subtracting from the value in cell (4a, 2) the prod- 
uct of (4a, 1) and (lb, 2). For larger problems, values in the 
additional cells of column 2 would then be determined in a cor- 
responding manner, i.e., subtracting from the value in cell (na, 2) 
the product of cell (na, 1) and cell (1b, 2). 

(i) The value in cell (2b, 3) is obtained by dividing the value 
in cell (3b, 2) by the value in cell (2b, 2). 


.02 
.20 

(j) For a four-factor problem, the value in cell (2b, 4) is ob- 
tained by dividing the value in cell (4b, 2) by the value in cell 
(2b, 2). For larger problems, the rest of the values in row 2 would 
be obtained by dividing the entry in cell (nb, 2) by the value in 
cell (2b, 2) to obtain (2b, n). 

(k) The value in cell (2b, Valid.) is obtained by subtracting 
from cell (2a, Valid.) the product of cell (2a, 1) and cell (1b, Valid.) 
and dividing by the value in cell (2b, 2).* 


(2b, 3) = — = 10 


3It should be noted here that consistency in rounding off is necessary if the 
work is to check. For example, in the computation in paragraph 5k, rounding to 
two decimal places consistently would give: 
.08 — (.20)(.42) _ .08 — .08 
-20 -20 


The use of three decimal places in one of the operations would give: 
.08 — (.20)(42) _ .08 — .084 
-20 20 


When cell (26, Valid.) has the value of .00, the computations will check. When it 
is given the value —.02, the computations will not check, and the error introduced 
will affect all further work. 


-00 


—.02 
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. .08 — (.20)(.42 
(2b, Valid.) oe .00 
.20 

(1) The value in cell (2b, Check) is obtained by subtracting 
from cell (2a, Check) the product of cell (2a, 1) and cell (1b, Check) 
and dividing by the value in cell (2b, 2). 
-64 — (.20)(2.09) 

.20 

(m) The result in (2b, Check) should be equal to 1.00 plus 

the value obtained by summing algebraically all values in row 


2b to the right of the diagonal, including the entry in the validities 
column. 


(n) Next, obtain the value in cell (3b, 3) by summing the prod- 
ucts of cell (3a, 1) times cell (1b, 3) and cell (3b, 2) times cell (2b, 3) 
and subtracting this sum from cell (3a, 3). 

(3b, 3) = 1.02 — [(.78)(.53) + (.02)(.10)] = .61 


(0) For a four-factor problem the value in cell (4b, 3) is ob- 
tained by: 


(4b, 3) = (4a, 3) — [(4a, 1) (1b, 3) + (4b, 2)(2b, 3)] 
For an n X n matrix, 

(nb, 3) = (na, 3) — [(na, 1)(1b, 3) + (nb, 2)(2b, 3)] 
(p) For a four-factor problem, 


(2b, Check) = 


1.10 


(4b, 3) 
b, 4) = 
(3b, 4) (3b, 3) 
For an n X n matrix, 
(nb, 3) 
3b = 
(3b, n) (3b, 3) 


(q) Next obtain the value in cell (3b, Valid.) by subtracting 
from cell (8a, Valid.) the sum of the products of cell (3a, 1) times 
cell (1b, Valid.) and cell (3b, 2) times cell (2b, Valid.) and dividing 
by the value in cell (3b, 3). 


85 — [(.78)(.42) + (.02)(.00)] _ 


(3b, Valid.) = ra 


85 
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(r) For a four-factor problem, the value in the (4b, validity) 
cell is obtained as follows: 


m Valid.) — [(4a, 1)(1b, Valid.) | 
. + (4b, 2)(2b, Valid.) + (4b, 3)(3b, Valid.)] 
(4b, Valid.) = Gb, 4) 

For an n-factor problem, 
(nb, Valid.) 


(na, Valid.) — [(na, 1)(1b, Valid.) + (nb, 2)(2b, Valid.) 
| +---+ (nb, n — 1)(n — 1b, Valid.)] | 
(nb, n) 

(s) Next, obtain the value in cell (3b, Check) by summing the 
products of cell (3a, 1) times cell (1b, Check) and cell (3b, 2) times 
cell (2b, Check) and subtracting this sum from cell (3a, Check) 
and dividing by the value in cell (3b, 3). 


(3b, Check) = 2.78 — [(.78)(2.09) + (.02)(1.10)] 


Zl 1.85 
Completed Factor Estimation Matrix 
(t) For a four-factor problem: 
# Check) — [(4a, 1)(1b, Check) 
(4b, Check) = + (4b, 2)(2b, Check) + (4b, 3)(3b, Check)] 


(4b, 4) 
For an n-factor problem, 


(nb, Check) 
jón Check) — [(na, 1)(1b, Check) + (nb, 2)(2b, cated 
+- --+ (nb, n — 1)(n — 1b, Check)’ 
(nb, n) 
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(u) The general outline of the procedure in step 5 for larger 
problems is as follows: 


(1) Obtain the (nb, n) value. In a four-factor problem, 


(4b, 4) = (4a, 4) — [(4a, 1)(1b, 4) + (4b, 2) (2b, 4) + (4b, 3)(3b, 4)] 
And for an n-factor problem, 


(nb, n) = (na, n) = [(na, 1)(1b, n) 
+ (nb, 2)(2b, n) +-+++ (nb, n — 1)((n — 1)b, n)] 


(2) Obtain the values in each column (calling the column m, 
where m < n) below the diagonal cell. The general 
formula is: 

(nb, m) = (na, m) — [(na, 1)(1b, m) 
+ (nb, 2)(2b, m) +----+ (nb, m — 1)((m — 1)b, m)] 

(3) Obtain the values in row nb to the right of the diagonal 


cell and to the left of the validity column. The general 
formula for cell (nb, o) where o >n is: 


(ob, n) 

(nb, n) 

(4) Obtain the value in the validity column by means of the 
general formula in step 5r. 

(5) Obtain the value in the check column by means of the 
general formula in step 5t. 

(6) Check the work by obtaining the algebraic sum of all 
values in the row to the right of the diagonal cell plus 1.0. 
This sum should equal the value in the check column. 


(nb, 0) = 


6. (a) The loading of the test (t) on the third or last factor is 
equal to the value in cell (3b, Valid.). 


Fin = .85 (or .8 rounded if the loadings 


are being reported to one decimal place) 


(b) The loading on the second factor is equal to cell (2b, 


Valid.) minus the product of cell (2b, 3) times the third factor 
loading. 


Fy = .00 — (.10)(.85) = —.08 (or —.1 rounded) 
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(c) The loading on the first factor is equal to cell (1b, Valid.) 
minus the sum of the products of cell (1b, 3) times the third factor 
loading, and cell (1b, 2) times the second factor loading. 


Fy = 42 — [(.53)(.85) + (.14)(—.08)] = —.02 (or .0 rounded) 


(d) For problems with more than three factors, the process is 
similar to that outlined above. The last factor loading is the 
value in the last b row of the Validities column. Computation of 
each successive loading involves subtraction of an additional cross- 
product, as illustrated in steps 6b and 6c above. The general 
formulas for an n-factor problem are: 


Fy = (nb, Valid.) 
Fy—1 = [(n — 1)b, Valid.] — [(n — 1)b, nJFy 
Fy— = [(n — 2)b, Valid.] — [(n — 2)b, n]Fy 
+ [(n — 2)b (n — 1) Fy 
Fy = (2b, Valid.) — [(2b, n)Fy + 2b (n — 1)Fy_4] 
bese by 3) For + 
F, = (1b, Valid.) — [(1b, n)Fy + 1b (n — 1)Fy_4] 
+--+ (1b, 3)Fim + (1b, 2)Fu 
7. If the factor loadings of a criterion variable rather than a 
test variable are to be estimated, the validity coefficients of the 


criterion for the tests of known factorial content are listed in 
column ¢ of the rotated factor matrix. 

8. If the factor loadings of more than one variable are to be 
estimated relative to the same rotated factor matrix, they may be 
estimated from the same basic factor estimation matrix (i.e., the 
factor estimation matrix found in step 2g. Also see footnote 1). 

9. It is useful as a last step to reproduce from the estimated 
factor loadings the correlations of the variable not factor-analyzed 
with the variables that were used in the factor analysis. Repro- 
duced and original correlations for the present example are com- 
pared on page 219. Discrepancies are not due to computational 
errors if work has been checked. Significant discrepancies may be 
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the result of an inadequate factor description of the enlarged 
matrix that includes the additional test or criterion. Discrepan- 
cies the size of the ones in this example might suggest the presence 
of one or more additional factors in the enlarged matrix. 


Added 
Variable Original Reproduced 
with Correlation Correlation 
1 a 6 
2 4 4 
3 ra 2 
4 al 2 


10. The estimated factor loadings obtained from this method 
can often be improved by iteration. One such method of iteration 
is presented by Wherry (1949) and Wherry, Campbell and Perloff 
(1951). 
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ANSWERS TO PROBLEMS: 


CHAPTER 2 
1. Two clusters 


B (7, 8, 3, 6) = 2.274 
B (1, 2, 4, 5) = 2.076 


2. Graph reveals the same two clusters as were obtained in Prob- 
lem 1 


CHAPTER 3 
1. ] 57 .66 .60 
.33 .29 .20 
68 .89 .75 
la 
2. .461 2.000 —.769 


1.333 —2.000 .000 
={1.589 2.000 1.538 


Aw 


CHAPTER 4 
1. .66 
2. Test 2 
3. .44 


1Since there are no unique answers to most factor analysis problems, these 
values are to be regarded as suggested solutions. 
267 
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4. Correlations 
Variables 1 2 3 4 
1 (66) .52 .46 
2 -52 (56) .38 .20 
3 .46 .38 (85) .75 
4 .41 .20 .75 (84) 
5. Test 3 


6. Increase the loading on factor II and decrease the loading on 
factor IV. 


CHAPTER 5 


1. Two factors as follows: 


Factor 


1 | 7616 .0000 
2 | .8272 © .3549 
3 | .6040 -.3935 
4 | .6696 -.0372 
5 | .4596 .1967 
6 | .2363 -.5516 
7 | 5909 ~-.6165 
8 | 3151 -.7356 


3. A clockwise rotation of 66° 50’ gives the following loadings: 


1 
2 
3 
4 
5 
6 
7 
8 


4. The algebraically derived values should agree closely with those 
obtained graphically. 
5. The centroid loadings, using the highest correlation or residual 


correlation in each column as an estimate of the communality, are as 
follows: 


Answers to Problems 269 


MNO UKWN = 


Tucker’s phi for factor II is .158 and for factor III it is .778. The 
factoring should be repeated once or twice for this small matrix to 
Stabilize the communalities. 


O©OMN DA VUAWN = 


3 
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Variable 


@Onouasawn = 


o Noua Wn = 


V-Matrix 
Factor 


CHAPTER 6 


Answers to Problems 


7.64 
3.57 


3.57 
8.28 


1.000 
-448 


.448 
1.000 


1.000 -000 
-.500 1.120 


Answers to Problems 


2. 


r Factor 2 
Variable A 
3 -7344 ~-.0888 | .547 
4 .5435 -.3706 | .433 
7 .8260 1275 | .699 
8 -7081 -3867 | .651 
CHAPTER 7 
1. 


Dyn aurk WD = 


1 12 22 53 
2 | -03 31 67 
3 “o -04 84 
4 11 .77 17 
5 36 80 07 
6 19 72 44 
? 76 .01 ee 
8 | .82 -14 © -.04 
9 76 .20 o8 


3 
a 
3 
Q 
© 
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CHAPTER 8 


w 


Rotated Factor 


1 53 35 00 
2 65 35 =U? 
3 85 7 -.08 
4 16 -80 08 
5 08 -84 30 
6 43 -82 10 
? 31 16 69 
8 07 =.04 81 
9 

10 


CHAPTER 9 


1. The rotated factors are identified as a numerical-facility factor 
(with tests 1, 2, and 3 loaded on it), a verbal-comprehension factor 


(with tests 4, 5, and 6), and an academic-achievement factor in educa- 
tional psychology (with tests 7, 8, 9, and 10). 
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Abilities, 2, 11, 111 
Achievement factors, 197 
Adjutant General’s Office, 206 
Analysis of variance, 3 

Andree, R. V., 28 

Aptitude factors, 197 

Sasi tia of factor analysis, 


additive, 46 
correlations due to common fac- 
tors, 47 
Attention, 11 
Axes 
Cartesian, 194 
coordinate, 38, 41 
long, 123 
oblique, 107 
primary, 137, 140, 192, 193 
reference, 38 
simple, 146, 192 


B-coefficient, 13-16 
Basic equation of factor analysis, 
_ _ 44 (footnote), 48 
Bi-factor method, 10, 11, 109 
Bipolar 
factors, 111 
simple structure, 112 
Brannan, M., 206 
Burt, C., 104, 207, 208 


C-matrix, 177 
Campbell, J. T., 219 
Carroll, J. B., 155, 207 
Cartesian axes, 194 


Categories 
classificatory, 1 
diagnostic, 2 
Cattell, A. K. S., 186 
Cattell, R. B., 13, 113, 186, 191, 
198 
Centroid formula for estimating 
communality, 166 
Centroid method of factoring, 59f. 
computation of first loadings, 61 
eleven-variable example, 71f. 
general rules for signs of load- 
ings, 66 
reflection of variables, 74 
residual correlation matrix, 73 
Checks 
on centroid loadings, 68, 84 
for diagonal loadings, 58 
importance of, 62 
on oblique simple structure 
loadings, 147 
for reflection, 65 
on rotated factor loadings, 127 
on rotation, 117 
for transformation matrix, 126 
Chi-square test for sufficient fac- 
tors, 206 
Clinical psychology, 2, 186 
Cluster analysis, 12-17 
comparison with factor analysis, 


12 
methods of, 13 
purposes of, 12, 13 
steps in computation, 14, 15 


Clusters of test vectors, 193 
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Columbia University, 206 
Common 
factors, 68 
traits, 2 
variance, 2, 45, 47 
Communality, 9, 35, 40, 41 (foot- 
note) 
defined, 47, 51 
estimated by centroid method, 
166 
estimated by highest r, 61 
re-estimated, 63 
sign of, 61 
Computation, mechanical aids to, 
205 


Conditioned response, 156 
Conditions of administration, 199 
Configuration 
bipolar simple structure, 112 
complete triangular, 112 
incomplete triangular, 113 
isolated constellations, 111 
of test vectors, 38, 41, 107, 178 
Coombs’ criterion, 80 
Correlation coefficients 
geometrical representation of, 
31, 32, 42 
interpretation of, 4 
part-whole, 169 
reproduced from factor load- 
ings, 35, 37, 48, 49 
spurious, 169, 201 
tetrachoric, 179, 201 
types of, 201 
Correlation of persons, 176 
Correlations between tests calcu- 
lated from factor loadings, 11 
Cosines of angles, 177 
Cowles, J. T., 212 (footnote) 
Cowles-Crout method, 213 (foot- 
note) 
Crawford, I., 206 
Criteria for rotation 
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parallel proportional profiles, 
114 


positive manifold, 110 
simple structure, 109 
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Deemer, W. L., 28 
Degan, J. W., 176 
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defined, 21 
evaluation of, 21 
of a matrix, 21 
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Difficulty level, 200, 207 
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goal of, 60 
signs of correlations in, 66 
steps in, 64 
of test vectors, 42, 60, 62, 64 
of variables, 74 
Reliability 
coefficient, 47, 51 
as diagonal entry, 51 
effect on factor loadings, 202 
Reporting a factorial study, 153 
Reproduction of correlations 
from cross products, 35, $7, 48 
(footnote) 
from factor loadings, 35, 37, 48 
Research programs, proper order 
for, 2 
Residual correlations, 59, 73 
centroid-factor, 59, 62, 67 
computation of, 62-63 
geometrical interpretation of, 
59 


second and subsequent, 67 
Reversals, sign (see also Reflec- 
tion), 42, 59 
Rhymer, R. M., 186 
Rimoldi, H. J. A., 173 
Rorschach test, 201 
Rotated factor loadings 
calculation of, 127 
checks on, 127 
Rotation of axes, 38, 106 
analytical, 41, 207 
checks for, 117, 120 
clockwise, 39 
counterclockwise, 39 
errors in, 117 
graphical, 41, 114 
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